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Key  Finding/Results/Accomplishments: 

Enzymes  are  biochemically  classified  as  hydrogenases  if  they  carry  out  either  1).  the  oxidation  of 
hydrogen  to  feed  energy  in  the  form  of  reduced  electrons  into  metabolism  or  2).  the  reduction  of 
protons  to  recycle  reduced  electron  carriers  during  fermentative  metabolism.  These  enzymes  also 
have  a  structural  classification  based  of  their  transition  metal  cofactor  into  one  of  three  classes  - 
[Fe]-hydrogenases,  [NiFe]-hydrogenases,  and  [FeFe] -hydrogenases.  The  first  of  these  produces 
hydrogen  gas  through  dehydrogenation  of  methylene-tetrahydromethanopterin,  while  the  latter 
two  both  catalyze  the  oxidation  of  diatomic  hydrogen  or  the  reduction  of  protons  to  hydrogen 
gas.  Of  these  three  classes,  [FeFe]-hydrogenase  has  been  shown  to  be  the  most  effective  in  the 
production  of  hydrogen  gas. 

Fiowever,  [FeFe]-hydrogenases  are  very  oxygen  sensitive  and  become  inactive  when  exposed. 
This  would  require  strictly  anaerobic  reactor  conditions,  which  adds  a  layer  of  complexity  due  to 
oxygen’s  natural  prevalence  in  air.  Furthermore,  oxygenic  photosynthesis,  the  most  effective 
photosynthesis,  is  inherently  incompatible  with  such  a  hydrogen-producing  metabolism  as  it 
introduces  oxygen  internally  to  the  microorganism.  Therefore,  oxygen  tolerance  in  an  [FeFe]- 
hydrogenase  is  highly  desirable. 

Oxygen  attacks  the  Fi-cluster,  the  active  site  inorganic  core  of  [FeFe]-hydrogenases.  The  FF- 
cluster  is  composed  of  a  [4Fe-4S]  subcluster  connected  by  a  cysteine  thiol  to  a  unique  [2Fe-2S] 
subcluster  with  CO  and  CN  ligands.  Environments  that  support  microbial  growth  and  feature  an 
oxygen  gradient,  such  as  bodies  of  water  with  a  sharp  halocline,  may  provide  a  selection  pressure 
to  produce  an  aerotolerant  [FeFe] -hydrogenases.  The  Guerrero  Negro  hyper-saline  ponds  and  the 
Great  Salt  Lake  have  such  characteristics  and  were  sampled  for  [FeFe]-hydrogenase  sequences 
for  this  application.  Variations  may  be  studied  either  by  inserting  a  consensus  sequences  into  a 
well  characterized  [FeFe]-hydrogenase,  such  as  that  of  Clostridium  pasteurianum  (CpF),  or  by  a 
shotgun  approach  of  making  and  screening  insertions  into  CpF  with  a  mixed  sample  of  many 
degenerately  primed  [FeFe]-hydrogenase  sequences  from  a  hyper-saline  environment.  Using 
nature  as  a  guide  may  allow  us  to  overcome  some  of  the  limitations  of  previous  studies,  using 
rational/computational  protein  design  and  random  mutagenesis.  These  studies  have  been  limited 
in  part  due  to  the  enzymes  inherent  properties  associated  with  maturation. 

Fn  conjunction  with  bioprospecting  and  construction  of  [FeFe]-hydrogenase  variants,  various 
supporting  analytical  and  spectroscopic  tests  are  used  to  characterize  the  [FeFe]-hydrogenase  and 
variants.  Assays  have  been  developed  to  measure  the  effect  of  oxygen  on  [FeFe]-hydrogenase 
activity  to  determine  the  half-lives.  Fn  addition,  UV-Visible  spectroscopy  has  been  used  to 
provide  direct  characterization  of  FF-cluster  degradation.  This  technique  can  be  utilized  to 
characterize  interesting  [FeFe]-hydrogenase  mutations  and  natural  variations.  The  [FeFe]- 
hydrogenase  produced  by  Chlamydomonas  reinhardtii  (CrFFydA)  is  particularly  useful  with  this 
method  because  it  lacks  the  iron-sulfur  cluster  containing  ferredoxin-like  domains  that  are 
present  in  CpF.  Therefore,  any  visible  absorption  by  iron-sulfur  clusters  in  this  protein  or  any 
modification  of  it  is  due  to  one  or  both  of  the  active  site  subclusters. 

Key  advances  during  the  2011-2014  funding  period  include:  (a)  We  have  established  a  battery  of 
hydrogenase  specific  activity  assays  and  FF  cluster  spectroscopic  probes  to  assess  oxygen 
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tolerance  in  a  quantitative  manner.  Hydrogen  oxidation  and  hydrogen  production  assays 
conducted  at  specific  time  points  during  oxygen  exposure  allow  for  the  determination  of  a  half- 
life  as  an  indicator  of  relative  oxygen  tolerance.  Comparative  examination  of  the  [FeFe]- 
hydrogenases  from  Chlamydomonas  reinhardtii  and  Clostridium  pasteurianum  indicate 
significantly  different  levels  of  oxygen  tolerance  with  the  half  life  of  the  C.  pasteurianum  having 
a  significant  longer  half  life  and  higher  oxygen  tolerance  that  C.  reinhardtii  in  line  with  the 
results  of  previous  studies,  (b)  We  have  characterized  the  changes  in  microbial  community 
structure  and  hydrogenase  diversity  as  a  function  of  the  vertical  transect  of  the  water  column  in 
the  Great  Salt  Lake.  The  findings  reveal  significant  changes  in  community  structure  as  a 
function  of  changing  salinity  and  large  hydrogenase  diversity  in  the  transition  zone  between  the 
oxic  and  anoxic  layers  as  well  as  the  anaerobic  water  and  benthic  layers,  (c)  We  have  tested  the 
hypothesis  that  sequence  variations  observed  through  the  GSL  water  column  include  structural 
determinants  for  oxygen  tolerance.  Using  both  single  amino  acid  substitution  and  cassette 
mutagenesis  to  probe  the  effect  of  observed  natural  sequence  variations  on  oxygen  tolerance  and 
through  single  amino  acid  substitutions  experiments  have  been  able  to  single  sites  for  which 
sequence  variations  result  in  clear  differences  in  oxygen  tolerance,  (d)  In  collaboration  with  the 
Posewitz  group  we  have  identified  and  characterized  a  strain  of  Tetraselmis,  Tetraselmis  GSL1, 
that  has  robust  hydrogen  producing  growth  characteristics  that  the  model  organism 
Chlamydomonas  reinhardtii  including  growth  under  halophilic  conditions,  (e)  We  have 
analyzed  data  on  the  distribution  of  16S  ribosomal  RNA  genes  and  HydA,  [FeFe] -hydrogenase 
structural,  genes  in  the  water  column  of  the  Great  Salt  Lake  in  collaboration  with  the  Posewitz 
group  and  have  some  interesting  insights  into  the  distribution  of  HydA  in  the  water  column  and 
the  influence  of  light  intensity  and  oxygen  concentration,  (f)  We  have  been  examining  specific 
sequence  variations  in  the  region  of  the  active  site  H  cluster  and  putative  gas  channels  that 
correlate  with  differences  in  oxygen  concentration  in  the  water  column  and  by  site-specific 
amino  acid  substitution  testing  potential  structural  determinants  for  greater  oxygen  tolerance,  (g) 
We  have  in  collaboration  with  Paul  King  and  David  Mulder  at  the  National  Renewable  Energy 
Laboratory  (NREL)  where  we  have  been  examining  the  mechanism  of  oxygen  inactivation  of 
[FeFe]-hydrogenases.  We  are  interested  in  the  mechanism  of  oxygen  inactivation  to  provide 
additional  insights  into  mechanisms  to  prevent  it.  We  have  concentrated  our  efforts  on 
characterizing  the  spectroscopic  properties  and  structural  features  of  [FeFe]-hydrogenases 
exposed  to  controlled  amounts  of  oxygen.  The  results  of  UV/Vis,  EPR,  and  IR  spectroscopic 
characterization  indicate  the  initial  site  of  oxygen  attack  is  the  2Fe  subcluster  that  becomes 
modified  and  then  becomes  depleted  relative  to  the  2[4Fe-4S]  subcluster.  We  have  shown  that 
oxygen  inactivated  hydrogenase  can  be  reactivated  in  vitro  through  addition  of  new  2Fe 
subcluster  by  the  activity  of  HydF  produced  in  a  genetic  background  having  both  HydE  and 
HydG  in  the  presence  of  a  strong  reducing  agent.  Structural  characterization  of  inactive 
hydrogenase  and  intermediates  supports  that  oxygen  inactivation  occurs  through  modification 
and  depletion  of  the  2Fe  subcluster  and  shows  that  the  free  cysteine  thiol  implicated  in  proton 
transfer  has  been  oxidized.  Presumably  this  explains  in  part  the  requirement  for  strong  reducing 
conditions  for  activation.  These  observations  provide  significant  new  insights  into  activation  and 
challenge  the  established  paradigms,  (h)  We  have  sequenced  the  genome  of  the  model  hydrogen 
producing  fennentative  organism  Clostridium  pastuerianum  and  conducted  preliminary 
transcriptional  analysis.  This  organism  expresses  multiple  [FeFe]-hydrogenases  presumed  to 
function  in  hydrogen  production  and  hydrogen  oxidation  in  response  to  different  physiological 
conditions.  The  hypothesis  based  on  mainly  biochemical  data  was  that  one  [FeFe]-hydrogenase 
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functioned  in  proton  reduction  during  sugar  fermentation  to  recycle  reduced  electron  carriers. 
Also  a  model  diazotroph,  C.  pasteurianum,  is  relieved  of  the  burden  of  recycling  electron 
carriers  when  fixing  nitrogen  and  actually  would  benefit  from  capturing  the  reducing  equivalents 
produced  as  hydrogen  during  the  nitrogenase  reaction.  Presumably  a  proton  reducing  [FeFe]- 
hydrogenase  is  produced  under  non  nitrogen  fixing  conditions  and  under  nitrogen  fixing 
conditions,  this  hydrogenase  is  down  regulated  in  lieu  of  a  second  [FeFe]-hydrogenase  that  is 
unregulated  and  functions  in  hydrogen  oxidation.  Through  sequence  analysis  and  subsequent 
transcriptional  analysis,  we  aimed  to  identify  the  hydrogenase  involved  in  hydrogen  oxidation 
during  nitrogen  fixation  and  through  sequence  comparison  identify  potential  structural 
determinants  (sequence  motifs)  that  define  superior  proton  reduction  or  hydrogen  oxidation 
activity  (or  in  others  words  the  motifs  that  control  the  reaction  equilibrium  by  effecting  the 
proton  donation  and/or  electron  transfer  reactions).  We  have  completed  this  study  and  have 
identified  the  hydrogen  oxidizing  hydrogenase  and  have  provided  new  insights  into  both 
hydrogen  metabolism  and  nitrogen  fixation  in  C.  pasteurianum  and  have  identified  several 
features  of  sequence  variation  that  could  potentially  fine  tune  enzymes  toward  proton  reduction 
and  hydrogen  oxidation  activities. 

The  results  of  studies  from  the  three  year  funding  period  are  detailed  either  in  published  works, 
submitted  manuscripts,  or  draft  manuscripts.  Submitted  manuscripts  and  draft  manuscripts  not 
yet  published  are  attached  in  an  appendix. 
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Abstract 

Although  significant  advances  in  H2  photoproduction  have  recently  been  realized  in  fresh  water  algae  (e.g.  Chlamydomonas 
reinhardtii),  relatively  few  studies  have  focused  on  H2  production  and  hydrogenase  adaptations  in  marine  or  halophilic 
algae.  Salt  water  organisms  likely  offer  several  advantages  for  biotechnological  H2  production  due  to  the  global  abundance 
of  salt  water,  decreased  H2  and  02  solubility  in  saline  and  hypersaline  systems,  and  the  ability  of  extracellular  NaCI  levels  to 
influence  metabolism.  We  screened  unialgal  isolates  obtained  from  hypersaline  ecosystems  in  the  southwest  United  States 
and  identified  two  distinct  halophilic  strains  of  the  genus  Tetraselmis  (GSL1  and  QNM1)  that  exhibit  both  robust 
fermentative  and  photo  H2-production  activities.  The  influence  of  salinity  (3.5%,  5.5%  and  7.0%  w/v  NaCI)  on  H2  production 
was  examined  during  anoxic  acclimation,  with  the  greatest  in  vivo  H2-production  rates  observed  at  7.0%  NaCI.  These 
Tetraselmis  strains  maintain  robust  hydrogenase  activity  even  after  24  h  of  anoxic  acclimation  and  show  increased 
hydrogenase  activity  relative  to  C.  reinhardtii  after  extended  anoxia.  Transcriptional  analysis  of  Tetraselmis  GSL1  enabled 
sequencing  of  the  cDNA  encoding  the  FeFe-hydrogenase  structural  enzyme  (FIYDA)  and  its  maturation  proteins  (FIYDE, 
FIYDEF  and  FIYDG).  In  contrast  to  freshwater  Chlorophyceae,  the  halophilic  Tetraselmis  GSL1  strain  likely  encodes  a  single 
FIYDA  and  two  copies  of  FIYDE,  one  of  which  is  fused  to  FIYDF.  Phylogenetic  analyses  of  FIYDA  and  concatenated  FIYDA, 
HYDE,  HYDF  and  HYDG  in  Tetraselmis  GSL1  fill  existing  knowledge  gaps  in  the  evolution  of  algal  hydrogenases  and  indicate 
that  the  algal  hydrogenases  sequenced  to  date  are  derived  from  a  common  ancestor.  This  is  consistent  with  recent 
hypotheses  that  suggest  fermentative  metabolism  in  the  majority  of  eukaryotes  is  derived  from  a  common  base  set  of 
enzymes  that  emerged  early  in  eukaryotic  evolution  with  subsequent  losses  in  some  organisms. 
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Introduction 

The  phylogenetically  unrelated  NiFe-  and  FeFe-hydrogenases 
have  convergently  evolved  to  catalyze  the  reversible  reduction  of 
protons  to  H2  (2H++2e-<  =  >H2)  [1].  Several  recent  studies  have 
documented  the  diversity  of  hydrogenase-encoding  genes  in 
environments  that  span  a  broad  range  of  geochemistry  [2-8].  In 
some  systems,  e.g.,  terrestrial  or  marine  hydrothermal  communi¬ 
ties,  H2  oxidation  has  been  suggested  to  represent  the  primary 
mechanism  of  energy  conservation  [9],  [10],  Yet,  in  other  systems, 
e.g.,  terrestrial  or  intertidal  phototrophic  communities,  H2 
evolution  appears  to  be  of  critical  importance  to  the  functioning 
of  the  assemblage,  in  particular  at  night  when  the  systems  become 
net  sources  of  H2  [11-13]. 

Biological  H2  production  requires  low-potential  reducing 
equivalents  derived  from  either  fermentative  pathways  that  oxidize 
fixed  carbon  (typically  carbohydrates),  or  from  photosynthetic 


pathways  [14-19],  Several  eukaryotic  algae  generate  fermentative 
H2  during  dark,  anoxic  acclimation  as  part  of  a  suite  of 
fermentative  pathways  that  catabolize  carbohydrates  to  alcohols, 
organic  acids  and  H2,  which  are  secreted.  These  metabolites  likely 
provide  a  rich  source  of  carbon  building  blocks  and  reducing 
equivalents  to  organisms  inhabiting  ecological  niches  adjacent  to 
the  algae,  which  are  responsible  for  the  majority  of  primary 
productivity  during  the  day.  Algae  are  also  able  to  use  reducing 
equivalents  from  the  photosynthetic  electron  transport  chain 
under  some  conditions  to  directly  reduce  hydrogenases  at  the  level 
of  ferredoxin  without  the  input  of  ATP,  a  pathway  that  is 
theoretically  regarded  as  the  most  efficient  biological  means  to 
transform  the  energy  in  sunlight  to  H2  for  biotechnological 
applications  [19-23],  To  date,  only  FeFe-hydrogenases  have  been 
unambiguously  identified  in  algae,  with  organisms  such  as 
Chlamydomonas  reinhardtii  encoding  truncated  enzymes  with 
only  the  catalytic  H-cluster;  a  4Fe4S  cluster  linked  via  a  bridging 
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cysteine  to  a  two  Fe  center  coordinated  by  CN  /CO  ligands  and 
a  bridging  dithiolate.  In  contrast,  Chlorella  variabilis  NC64A 
encodes  two  hydrogenase  enzymes  with  both  the  H-cluster 
catalytic  domain  and  an  F-cluster  domain  that  coordinates 
additional  FeS  clusters  that  putatively  function  in  electron  transfer 
[18],  [19],  [24—26], 

Despite  widespread  interest  in  algal  Flo  production,  contempo¬ 
rary  research  is  focused  almost  exclusively  on  freshwater  species  of 
Chlamydomonas,  Scenedesmus,  and  Chlorella,  with  C.  reinhardtii 
being  the  model  system  for  the  vast  majority  of  algal  hydrogenase 
research  [27],  [28],  Significant  advances  in  H2  photoproduction 
from  C.  reinhardtii  have  recendy  been  reported  [29-33];  whereas 
relatively  few  studies  have  examined  H2  production  from  marine 
or  halophilic  algae  [34—36], 

Flalophilic  algae  offer  several  advantages  for  large-scale  algal  FI2 
production.  First,  the  use  of  halophilic  algae  will  enable  H2 
production  in  readily  available  salt  water,  thereby  minimizing  the 
potential  use  of  limited  fresh  water  resources.  Secondly,  gas 
solubility  is  reduced  in  aquatic  saline  systems  [37],  which  is 
potentially  advantageous  because  the  levels  of  soluble  02,  a  potent 
inhibitor  of  most  FeFe-hydrogenases  [38],  are  diminished  in  salt 
water  relative  to  fresh  water.  Hydrogen  is  also  less  soluble  in  saline 
systems  and  therefore  more  easily  removed.  Lastly,  ATP  hydrolysis 
is  increased  in  saline  media  to  maintain  proper  cellular  ion 
gradients.  Hydrogenase  activity  is  an  ATP  neutral  process,  and 
photosynthetic  electron  transport  has  been  shown  to  decline  as 
photosynthetic  ATP  levels  rise  during  H2  photoproduction  in 
some  freshwater  algae  [39],  [23].  Salt  stress  represents  a  potential 
mechanism  to  alleviate  ATP  accumulation,  which  can  inhibit  the 
metabolic  pathways  supplying  reductant  to  hydrogenase. 

To  further  expand  our  understanding  of  H2  metabolism  in 
extreme  environments,  and  to  potentially  discover  novel  organisms 
with  enzymes  that  have  superior  biotechnological  attributes,  we 
initiated  an  effort  to  isolate  hydrogen-producing  organisms  from 
hypersaline  ecosystems.  Although  salt  water  systems  are  ubiquitous 
around  the  world,  relatively  little  is  known  regarding  H2 
metabolism  in  saline  systems,  or  whether  unique  enzyme  features 
are  necessary  for  activity  in  halophiles  [40],  [41], 

Methods 

Algal  strains  and  growth  conditions 

Halophilic  algae  from  the  genus  Tetrasehnis  (Chlorophyceae) 
were  isolated  from  water  samples  collected  at  (a)  a  roadside  pool 
bordering  the  Great  Salt  Lake  (GSL),  Utah,  USA,  with  a  salinity  of 
6.0%  w/v  (Tetraselmis  GSL1)  and  (b)  a  hypersaline  pond  with  a 
conductivity  reading  of  660  mS/ cm  near  Quemado,  New  Mexico, 
USA  [42]  (Tetraselmis  QNM1).  GSL  samples  were  taken  in 
conjunction  with  the  Great  Salt  Lake  Institute,  which  holds  a 
permit  for  sampling  on  Utah  state  lands,  granted  by  the  Utah 
Department  of  Natural  Resources,  Division  of  Forestry,  Fire  and 
State  Lands.  Water  sampling  at  the  New  Mexico  site  was 
conducted  on  public  lands  administered  by  the  Bureau  of  Land 
Management,  and  did  not  require  a  permit  nor  involve 
endangered  or  protected  species.  Both  Tetraselmis  isolates  have 
been  submitted  to  the  University  of  Texas  (UTEX)  algal  culture 
collection. 

Isolates  were  routinely  cultured  in  f/2  medium  amended  with 
Booth  Bay  sea  water  (National  Center  for  Marine  Algae  and 
Microbiota,  NCMA),  at  pH  8.0  [43]  and  3.5%  of  salinity.  Where 
indicated,  salinities  were  increased  to  5.5%  or  7.0%  w/v  with 
NaCl  addition.  Cultures  were  maintained  at  29°C,  without 
agitation  at  a  constant  illumination  of  ~30  pmol  m  2  s  1  of 
photosynthetically  active  radiation  (PAR)  by  cool-white  fluorescent 


lights.  Cell  counts  were  assessed  using  a  Z2  Coulter  cell  and 
particle  counter  (Beckman-Coulter). 

C.  reinhardtii  strain  CC124  (nit-,  mt-)  was  obtained  from  the 
Chlamydomonas  Genetic  Center  (http://www.chlamy.org/)  and 
grown  in  tris-acetate-phosphate  (TAP)  medium  pH  7.2  [44],  and 
shaken  at  1 20  rpm  under  constant  fluorescent  irradiance  (80  pmol 
m  2  s  1  of  PAR  by  GE  Ecolux  6500K  T5  bulbs).  Cells  were 
harvested  during  mid-logarithmic  growth  (16—20  |tg  Chl/ml)  for 
measuring  in  vitro  hydrogenase  activities  and  fermentative  H2 
production. 

Chlorophyll  and  total  protein  determination 

Total  chlorophyll  was  determined  spectrophotometrically  by 
extraction  in  100%  methanol  for  the  Tetraselmis  strains  and  95% 
ethanol  for  C.  reinhardtii  [44],  Tetraselmis  cells  were  washed  with 
2  volumes  of  MilliQ  water  to  remove  salt  prior  to  pigment 
extraction.  Total  chlorophyll  was  selected  as  the  standard  in 
normalizing  hydrogenase  activity  with  the  light-absorbing  capacity 
of  isolates. 

Total  protein  content  was  analyzed  using  a  Modified  Lowry 
Protein  Assay  (Pierce)  according  to  die  manufacture’s  instructions. 
Cell  pellets  were  washed  with  2  volumes  of  MilliQ  water  and  then 
solubilized  in  0.5%  SDS.  Total  protein  was  quantified  using  BSA 
solubilized  in  0.5%  SDS  to  generate  a  standard  curve. 

Anaerobic  induction 

For  H2-photoproduction  measurements,  75  ml  of  mid-log  phase 
cell  cultures  were  grown  at  the  indicated  NaCl  levels  (3.5%,  5.5% 
and  7.0%  w/v),  concentrated  by  centrifugation  at  3716  x  g  for 
15  min  at  25°C,  resuspended  in  1  ml  of  f/2  medium  (at  the  same 
salinity  used  for  culturing)  and  transferred  to  16-ml  glass  serum 
vials  covered  with  aluminum  foil  to  exclude  light.  For  fermentative 
metabolite  analyses,  dark  H2-production  and  C02-evolution 
measurements,  50  ml  of  mid-log  phase  cell  cultures  were 
harvested,  resuspended  in  1  ml  of  fresh  medium,  and  transferred 
to  16-ml  glass  serum  vials  covered  with  aluminum  foil.  For  in  vitro 
hydrogenase  activity  assays  (see  below),  1  ml  of  mid-log  liquid  cell 
cultures  was  directly  transferred  to  anaerobic  vials.  For  C. 
reinhardtii,  in  vitro  hydrogenase  activity  and  fermentative  H2 
production,  5  ml  of  cells  from  mid-log  cultures  were  harvested  by 
centrifugation  (3190  x  g  for  1 0  min)  and  resuspended  in  1  ml  of 
anaerobic  induction  buffer  (AIB;  50  mM  potassium  phosphate 
buffer,  pH  7.2,  3  mM  MgCL),  then  transferred  to  13-ml  glass 
serum  vials  covered  with  aluminum  foil.  All  vials  were  sealed  with 
septa,  purged  for  30  min  with  ultra-high  purity  argon  and  kept 
sealed  in  the  dark  for  4  h  or  24  h,  as  indicated. 

H2-photoproduction  measurements 

Maximum  in  vivo  H2-photoproduction  rates  were  determined 
using  a  custom-built  Ag/AgCl  polarographic  electrode  system 
(ALGI).  H2-  and  02-photoproduction  rates  were  measured 
simultaneously  with  two  YSI  5331 A  electrodes  (YSI  Incorporated, 
Yellow  Springs,  OH,  USA),  poised  at  +/  —  0.6  V,  in  a  water- 
jacketed  (25°C)  assay  chamber.  02  and  H2  electrodes  were 
calibrated  between  each  measurement  using  f/2  medium  (at  the 
respective  salinity  of  tested  cells)  saturated  by  atmospheric  02  or 
by  ultra-high  purity  5.3%  H2  (Ar  balance),  respectively.  Pure  Ar 
purging  was  used  to  determine  electrode  baselines  and  to  sparge 
the  assay  chamber  containing  0.8  ml  of  f/2  medium  of  02.  0.2  ml 
of  4  h  and  24  h  anaerobically  induced  cells  were  then  introduced 
into  deoxygenated  buffer  in  the  sample  chamber.  After  a  30  s  dark 
acclimation,  cells  were  illuminated  for  30  s,  using  saturating  LED 
(Luxeon  III  Star,  Lumileds)  irradiance  of  2000  pmol  photons  m  2 
s  .  H2-photoproduction  rates  were  calculated  from  the  initial 
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slope  during  the  first  1 0  s  of  illumination  from  the  H2-dependent 
current  increase. 

In  vitro  hydrogenase  activity 

In  vitro  Tetraselmis  GSL1  and  QNM1  ^-production  activity 
assays  were  performed  by  transferring  1  ml  of  2X  methyl  viologen 
(MV)  buffered  solution  (10  mM  MV,  50  mM  potassium  phos¬ 
phate,  pH  6.9,  and  0.2%  triton  X-100)  and  0.2  ml  of  reduced 
sodium  dithionite  solution  (100  mM  in  30  mM  NaOH)  to  1  ml  of 
cells  grown  in  f/2  at  the  indicated  salinities  and  acclimated  to 
anoxic  conditions  in  f/2  medium  at  the  same  salt  levels  used  for 
culturing  for  4  h  or  24  h  in  sealed,  Ar-purged  16-ml  vials.  For  the 
data  in  Table  SI,  the  in  vitro  hydrogenase  activity  protocol  was 
modified  for  direct  comparison  of  Tetraselmis  in  vitro  hydrogenase 
activity  to  the  in  vitro  hydrogenase  activity  from  the  fresh  water 
alga  C.  reinhardtii  using  final  MV  assay  solutions  that  were 
equivalent  in  salt  concentrations  for  the  different  species.  To 
obtain  the  same  final  assay  buffer  composition,  C .  reinhardtii  cells 
grown  in  TAP  medium,  were  resuspended  in  1  ml  of  AIB  and 
acclimated  to  anoxic  conditions  for  4  or  24  h  in  sealed,  Ar-purged 
13-ml  vials.  1  ml  of  either  a  2X  MV  solution  (10  mM  MV, 

50  mM  potassium  phosphate,  pH  6.9,  and  0.2%  triton  X-100), 
used  as  a  control,  or  2X  MV-f/2  adjusted  solutions  (10  mM  MV, 

50  mM  potassium  phosphate,  pH  6.9,  and  0.2%  triton  X-100 
dissolved  in  f/2  medium  and  adjusted  to  3.5%,  5.5%  or  7.0% 
NaCl)  was  added  to  measure  hydrogenase  activity.  To  1  ml  of  the 
Tetraselmis  cells  grown  and  anaerobically  induced  in  f/2  medium 
at  3.5%,  5.5%  or  7.0%  NaCl,  1  ml  of  2X  MV-AIB  (10  mM  MV, 
100  mM  potassium  phosphate,  pH  6.9,  3  mM  MgCl2)  was  added. 

0.2  ml  of  reduced  sodium  dithionite  (100  mM  in  30  mM  NaOH) 
was  then  added  to  initiate  hydrogenase  mediated  H2  production  in 
all  assays.  The  reactions  were  incubated  (37°C)  in  a  shaking  water 
bath  and  H2  evolution  measured  for  C.  reinhardtii,  and  the 
Tetraselmis  strains  by  gas  chromatography  (GC)  using  a  Hewlett 
Packard  Series  II  5890  instrument  fitted  with  a  Restek  5  A 
Molecular  Sieve  80/100  6'  1/8"  column  and  a  diermal 
conductivity  detector  with  Ar  as  the  carrier  gas.  The  resulting 
signal  was  integrated  using  ChemStation  software  and  H2  was 
quantified  using  a  standard  curve  generated  from  known 
quantities  of  H2. 

Dark  fermentative  H2,  C02  production 

Following  acclimation  of  cells  to  dark,  anoxic  conditions  for  4  or 
24  h,  H2  production  was  measured  by  withdrawing  0.2  ml  of 
headspace  gas  and  analyzing  by  GC  for  both  C.  reinhardtii  and 
the  Tetraselmis  strains.  C02  production  was  also  measured  from 
cells  acclimated  to  dark,  anoxic  conditions  (0.5,  4  and  24  h).  To 
sealed  vials  containing  acclimated  cells,  1  ml  of  1  M  HC1  was 
added.  The  acidified  cell  suspension  was  then  shaken  vigorously  to 
liberate  C02  into  the  vial  headspace,  and  C02  (0.2  ml  injection) 
was  quantified  by  GC  (Hewlett  Packard  5890  series  II)  using  a 
Supelco  column  (80/100  PORAPAK)  coupled  to  a  diermal 
conductivity  detector  using  He  as  the  carrier  gas.  The  resulting 
signal  was  integrated  using  ClassVP  software,  and  the  gas  was 
quantified  using  a  standard  curve  generated  from  known 
quantities  of  C02. 

Dark  fermentative  metabolite  analyses  and  intracellular 
sugar  content 

After  4  or  24  h  of  dark-anoxic  acclimation,  fermentative 
products  were  analyzed  by  high  pressure  liquid  chromatography 
(HPLC).  Samples  were  collected  after  centrifugation  of  acclimated 
cells  (3  min  at  17000  x  g)  and  the  resulting  supernatant  filtered 
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through  a  silicon  filter  (0.45  pm).  Filtrate  (25  pi)  was  injected 
(thermostated  sample  tray  held  at  10°C)  onto  an  HPLC  column 
(fermentation  monitoring  column  (BioRad),  150x7.8  mm,  8  mM 
H2S04  as  eluent,  flow  rate  of  0.5  ml/min,  column  operating 
temperature  of  45  °C,  refractive  index  (RI)  detector  operating 
temperature  50°C,  in  parallel  with  a  photodiode  array  detector 
(PAD)  at  210  nm).  Resulting  signals  were  integrated  and 
metabolites  quantified  using  a  standard  curve  generated  from 
standards  for  each  metabolite  detected.  The  remaining  cell  pellet 
was  used  for  determining  cellular  dry  weights  and  the  reducing 
carbohydrate  content.  Cell  pellets  were  resuspended  in  the  same 
volume  of  MilliQ,  water,  with  100  pi  used  for  total  quantification 
by  the  anthrone  assay  as  described  previously  [25],  and  the 
remaining  cells  used  for  dry  weight  determination. 

Genomic  DNA,  total  RNA  isolations  and  sequencing 

Genomic  DNA  from  Tetraselmis  GSL1  and  Tetraselmis 
QNM1  was  isolated  from  15  ml  of  cell  culture  using  a 
phenol:  chloroform  extraction  protocol  as  described  previously 
[45].  Total  RNA  was  isolated  from  20  ml  of  100-fold  concentrated 
mid-log  Tetraselmis  GSL1  cells  grown  in  f/2  at  3.5%  salinity  and 
anaerobically  induced  in  60  ml  sealed,  Ar-purged  vials  for  4  h. 
Fermentative  H2  production  was  confirmed  by  GC  before  RNA 
isolation.  Briefly,  a  cell  pellet  was  obtained  by  centrifugation  (3716 
x  g  for  10  min  at  25  °C),  washed  with  2  volumes  of  MilliQ  water 
followed  by  centrifugation,  then  resuspended  in  5  ml  of  Plant 
RNA  Reagent  (Invitrogen)  with  RNA  isolated  according  to  the 
manufacturer’s  instructions  for  small  scale  RNA  isolation,  with  the 
exception  that  all  reagent  volumes  were  multiplied  by  a  factor  of 
10.  Total  RNA  was  concentrated  using  the  RNeasy  MinElute 
Cleanup  Kit  (Qiagen),  according  to  the  manufacturer’s  instruc¬ 
tions.  RNA  quality  was  confirmed  via  2%  agarose  gel  electropho¬ 
resis  and  quantified  using  a  NanoDrop®  ND-1000  (Thermo 
scientific),  as  well  as  by  fluorescence  using  Quant-iT™  Ribo- 
Green®  RNA  Reagent  and  Kit  (Invitrogen).  All  purification  steps 
were  done  with  RNase  free  reagents. 

Total  RNA  was  submitted  for  sequencing  to  the  National 
Center  for  Genome  Resources  (NCGR,  Santa  Fe,  NM)  as  part  of 
the  Marine  Microbial  Eukaryote  Transcriptome  Project  (Gordon 
and  Betty  Moore  Foundation).  RNA  libraries  were  prepared  from 
total  RNA  isolated  using  the  standard  Tru-Seq™  RNA  protocol  - 
poly-A+  selection  (Illumina)  with  an  insert  size  of  ~200  bp  and 
sequenced  from  both  ends  (paired-end  reads  2x50-nt)  on  the 
Illumina  Hi-Seq  2000.  The  total  sequence  data  generated  for  each 
sample  (MMETSP0419_1  &  2)  was  approximately  2.5  Gbp. 
Reads  were  filtered  for  quality  (QJ5)  and  read  length  (0.5). 

Transcriptome  Assembly 

AbySS  [46]  was  used  to  generate  20  assemblies  across  a  k-mer 
sweep  from  26  to  50  nt.  These  assemblies  were  then  subjected  to 
gap  closing  with  GapCloser  v  1.10  (Short  Oligonucleotide  Analysis 
Package,  SOAPdenovo  [47]).  Gap  closed  assemblies  from  the  k- 
mer  sweep  were  reconciled  into  one  assembly  using  miraEST  [48] . 

Identification  of  hydrogenase  structural  and  maturation 
genes 

Hydrogenase  structural  and  maturation  genes  within  the 
Tetraselmis  GSL1  transcriptome  assembly  were  identified  using 
tBLASTn  (NCBI).  HYDA,  HYDEF  and  HYDG  from  Chlamy- 
domonas  reinhardtii  (CAC83731,  AAS92601,  AAS92602,  respec¬ 
tively)  and  Chlorella  variabilis  NC64A  sequences  (AEA34989, 
EFN57384,  EFN57653,  respectively)  were  used  as  search  queries. 
Assemblies  of  all  identified  homologs  were  manually  inspected. 
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RNAseq  reads  were  mapped  back  to  these  assemblies  using 
GSNAP  (Genomic  Short-read  Nucleotide  Alignment  Program)  to 
determine  coverage  and  transcript  variation.  Mapped  reads  were 
reassembled  with  Geneious  (v.5.5.7,  Biomatters)  to  validate  the 
AbySS  assembly. 

To  verify  the  presence  of  an  unfused  HYDE  gene,  PCR 
amplification  of  genomic  Tetraselmis  GSL1  DNA  was  used  to 
amplify  a  region  of  the  unfused,  independent  HYDE  from  the  3'- 
coding  region  (EU-F1  5'CGACAAGAAGGCCCACCTG- 
GAGA3')  to  the  3'UTR  of  HYDE  (EU-R1  5' 
GCGTACCTCGCCTGCCCTTACTA  3'),  which  spans  die 
unfused  HYDE  stop  codon.  To  verify  the  presence  of  the  HYDEF 
fusion  in  gDNA,  primers  corresponding  to  the  3'-region  of  the 
HYDE  segment  in  the  HYDEF  fusion  (EF-F1  5' 
GTCCCGCTACCTTGTCCGCATTG  3')  and  to  the  5'-region 
of  HYDF  in  HYDEF  (EF-R2  5'GAATGTGTGCC- 
GAGCTGTGCT  3')  were  used,  which  spans  the  HYDEF  fusion 
sequence  and  does  not  contain  a  stop  codon  after  the  HYDE  gene 
segment.  PCR  conditions  included:  polymerase  activation:  95°C 
2  min,  40  cycles:  denaturing  95°C  20  s,  annealing  60°C  20  s, 
elongation  72°C  30  s,  and  a  final  elongation  at  72°C  for  5  min 
using  the  KOD  DNA  polymerase  and  master  mix  (Promega). 
Amplified  products  were  sequenced  using  the  Davis  DNA 
sequencing  facility  (Davis  Sequencing).  All  retrieved  sequences 
have  been  deposited  in  GenBank  under  the  accession  numbers: 
KC820787-KC820794,  and  KC832401. 

Molecular  phylogeny 

DNA  was  isolated  from  algal  colonies  grown  on  agar  using  the 
UltraClean  Soil  DNA  Isolation  Kit  (MoBio  Laboratories).  Partial 
18S  SSU  rRNA  genes  (>1350  and  1120  bp,  respectively),  for 
Tetrasehnis  GSL1  and  Tetraselmis  QNM1  were  amplified  and 
sequenced  from  each  alga  using  the  eukaryote-specific  primers 
360FE  and  1492RE  [49].  Sequences  were  aligned  using  the  NCBI 
BLAST  algorithm  queried  against  the  nr/nt  database.  The 
complete  Tetraselmis  GSL1  18S  SSU  rRNA  gene  (1802  bp)  was 
attained  from  transcriptome  data  and  deposited  in  GenBank 
(accession  no.  KC820794). 

The  phylogenetic  position  of  HYDA  relative  to  representatives 
available  in  databases  [50]  was  determined  using  maximum 
likelihood  approaches  similar  to  those  described  previously  [7], 
[51].  Briefly,  HYDA  homologs  were  aligned  using  the  Gonnet 
substitution  matrix  specifying  a  gap  opening  penalty  of  1 3  and  a 
gap  extension  penalty  of  0.05.  The  alignment  block  was  trimmed 
to  contain  only  the  aligned  positions  corresponding  to  the  H- 
cluster  only  homolog,  HYDA1,  from  C.  reinhardtii  (CAC83731). 
The  phylogenetic  position  of  HYDA  from  Tetraselmis  GSL1  was 
determined  using  PhyML  (ver.  3.0)  [52]  specifying  the  LG 
substitution  model  with  gamma  distributed  rate  variation  and  a 
proportion  of  invariable  sites,  as  recommended  by  ProtTest  (ver. 
3.2.1)  [53],  A  HYDA  paralog,  i.e.,  Narl,  from  Saccharomyces 
cerevisiae,  served  as  the  outgroup  in  the  phylogenetic  analysis  of 
HYDA. 

To  further  increase  the  resolution  of  our  phylogenetic  analysis  of 
the  FeFe-hydrogenase  identified  in  the  transcriptome  of  Tetra¬ 
selmis  GSL1,  we  compiled  homologs  of  HYDA,  HYDE,  HYDF, 
and  HYDG  from  all  eukaryotes  with  available  genomes.  All 
homologs  were  aligned  and  trimmed  as  described  above. 
Alignment  blocks  were  concatenated  using  base  functions  within 
PAUP  (ver.  4.0)  [54]  and  the  phylogenetic  position  of  the 
HYDAEFG  concatenated  alignment  was  determined  using 
PhyML  as  described  above.  HYDAEFG  homologs  from  Bacte- 
roides  thetaiotaomicron  VPI-5482,  served  as  the  outgroup  in  this 
phylogenetic  analysis. 
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Results 

Identification  of  hydrogenase  activity  in  halophilic  strains 
of  Tetraselmis 

To  assess  whether  algae  with  hydrogenase  activity  are  present  in 
hypersaline  environments,  water  samples  were  collected  from  a 
variety  of  GSL  sites  with  salinities  ranging  from  3.5—25%,  and 
unialgal  isolates  from  these  water  samples  were  obtained  using 
flow  cytometry.  Hydrogenase  activity  screens  of  over  40  unialgal 
isolates  recovered  from  GSL  using  the  reduced  MV  assay  revealed 
that  the  most  robust  hydrogenase  activity  was  detected  from  a 
tetraflagellate  alga  that  morphology  and  18S  rRNA  gene  analysis 
indicated  to  be  a  novel  strain  of  Tetraselmis  (Tetraselmis  strain 
GSL1),  isolated  from  a  roadside  pool  bordering  GSL  with  a 
salinity  of  6.0%  w/v.  Notable  hydrogenase  activity  was  not 
detected  in  any  of  the  isolated  strains  of  Dunaliella,  which  have 
been  shown  to  be  the  numerically  dominant  alga  in  several  GSL 
environments  [55].  We  also  obtained  an  axenic  culture  of  another 
halophilic  strain  of  Tetraselmis  (Tetraselmis  QNM1),  which  was 
isolated  from  a  hypersaline  pond  in  New  Mexico  and  was  available 
in  the  Colorado  Center  for  Biofuels  and  Biorefining  algal  culture 
collection  [42].  Tetraselmis  GSL1  and  Tetraselmis  QNM1  18S 
rRNA  genes  are  most  similar  to  the  recently  described  Tetraselmis 
indica,  isolated  from  salt  pans  in  Goa,  India  [56].  Both  Tetraselmis 
GSL1  and  Tetraselmis  QNM1  remained  viable  at  NaCl  concen¬ 
trations  greater  than  10%,  but  growth  was  reduced  at  these  NaCl 
levels.  Therefore,  subsequent  characterization  of  hydrogenase 
activities  in  the  Tetraselmis  isolates  were  done  at  a  comparable 
ocean  salinity  of  3.5%,  as  well  as  at  the  increased  salinities  of  5.5% 
and  7.0%  (die  later  concentrations  are  similar  to  the  salinity  where 
this  taxon  was  isolated)  to  determine  the  effects  of  elevated  NaCl 
concentrations  on  hydrogenase  activity.  As  shown  in  Figure  1,  the 
highest  growth  rates  and  cell  densities  are  observed  in  medium 
with  3.5%  NaCl,  with  growth  rates  declining  as  NaCl  concentra¬ 
tion  is  increased.  In  the  case  of  Tetraselmis  QNM1,  cells  reached 
stationary  phase  earlier  when  cultured  at  7.0%  NaCl,  yet  final  cell 
densities  were  approximately  50%  lower. 

Hydrogenase  activity  in  vitro  as  a  function  of  salinity 

Given  previous  data  indicating  that  salt  levels  impact  algal 
metabolism  and  protein  expression  [41],  [57—60],  hydrogenase 
activity  was  assessed  after  culturing  at  3.5%,  5.5%  and  7.0%  NaCl 
concentrations.  Cells  were  acclimated  in  the  dark  under  anoxic 
conditions  for  4  or  24  h  prior  to  measurement  of  in  vitro 
hydrogenase  activities  using  cell  extracts  and  reduced  MV.  As 
shown  in  Figure  2,  significant  hydrogenase  activity  is  observed  in 
cultures  grown  at  all  three  NaCl  concentrations,  with  a  diminution 
of  total  in  vitro  activity  as  extracellular  NaCl  levels  increase  for 
both  strains.  The  in  vitro  hydrogenase  activity  measured  at  3.5% 
NaCl  is  more  than  an  order  of  magnitude  greater  than  the  in  vitro 
hydrogenase  activity  typically  reported  on  a  chlorophyll  basis  for 
C.  reinhardtii  [61].  As  the  ionic  strength  of  the  assay  buffer  can 
influence  the  measured  in  vitro  hydrogenase  activity  [62],  we 
tested  whether  the  differences  in  hydrogenase  activity  can  be 
attributed  to  the  higher  ionic  strength  used  to  measure  the 
Tetraselmis  MV-mediated  hydrogenase  activity.  As  shown  in 
Table  SI,  when  both  the  Tetraselmis  strains  and  C.  reinhardtii  are 
assayed  using  equivalent  final  MV  assay  solutions  at  variable  salt 
levels,  the  Tetraselmis  strains  still  show  in  vitro  hydrogenase 
activity  that  is  about  an  order  of  magnitude  greater  than  the  rates 
from  C.  reinhardtii,  with  the  greatest  differences  emerging  at  the 
24  h  time  point  (up  to  ~30  fold  at  24  h  and  1.6%  NaCl).  We  also 
assessed  hydrogenase  activity  as  a  function  of  total  protein  levels 
(Table  SI),  and  again  hydrogenase  activity  was  approximately  an 
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Figure  1.  Cell  densities  in  f/2  medium  adjusted  to  3.5%,  5.5% 
or  7.0%  NaCI  (w/v).  Tetraselmis  GSL1  (A)  and  Tetraselmis  QNM1  (B) 
cell  numbers  were  determined  using  a  Beckman  Z2  Coulter  particle 
counter.  Error  bars  represent  standard  deviations  from  6  experimental 
repetitions.  Representative  light  microscope  images  from  cells  grown  at 
3.5%  NaCI  are  inset. 
doi:1 0.1 371/journal.pone.008581 2.g001 
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order  of  magnitude  greater  than  is  observed  from  C.  reinhardtii 
after  24  h  of  anaerobic  induction,  and  ~5  times  higher  at  4  h  of 
anaerobic  induction,  when  the  NaCI  levels  in  the  MV  assay  are 
equivalent. 

Hydrogen  photoproduction  activities  in  vivo 

Hydrogen  photoproduction  was  assayed  using  a  custom-built, 
Clark-type  electrode  [61]  (Fig.  3).  Both  Tetraselmis  strains  showed 
significant  H2-photoproduction  rates  (albeit  ~2-fold  less  than 
those  attained  in  C.  reinhardtii,  when  rates  were  normalized  to 
total  chlorophyll  at  4  h  of  anoxic  acclimation),  indicating  that 
hydrogenase  activity  is  coupled  to  the  photosynthetic  electron 
transport  chain,  as  is  observed  in  several  species  of  the 
Chlorophyceae  [50] .  In  contrast  to  the  results  of  the  MV  in  vitro 
hydrogenase  assays,  H2  photoproduction  in  Tetraselmis  GSL1 
increased  slightly  as  a  function  of  salinity.  Given  that  the  in  vivo 
H2-photoproduction  rates  are  significantly  lower  (~30  fold)  than 
the  measured  in  vitro  (i.e.,  MV  assays)  rates,  excess  enzyme 
capacity  exists  under  these  conditions  relative  to  the  amount  of 
reductant  supplied  by  the  photosynthetic  electron  transport  chain 
for  H2  production.  These  results  suggest  that  although  the  amount 
of  active  enzyme  decreases  as  a  function  of  salinity,  the  efficiency 
of  hydrogenase  coupling  to  the  photosynthetic  electron  transport 
chain  increases.  Interestingly,  in  C.  reinhardtii  and  other  green 
algae,  the  in  vivo  hydrogenase  activity  levels  are  frequently  similar 
to  the  in  vivo  H2-photoproduction  rates  [61],  which  is  not 
observed  in  the  two  Tetraselmis  isolates  studied.  Similar  to  the  in 
vitro  assays,  H2-photoproduction  rates  increased  (~2— 3  fold)  at 
24  h  relative  to  4  h  of  dark,  anoxic  acclimation.  H2-photopro- 
duction  rates  were  higher  (~3M  fold)  than  those  of  C.  reinhardtii 
(Table  SI)  when  normalized  to  total  chlorophyll  following  24  h  of 
dark,  anoxic  acclimation  [61]. 


Figure  2.  Methyl  viologen  mediated  in  vitro  hydrogenase 
activity.  Hydrogen  production  rates  from  anoxic  Tetraselmis  GSL1  (A) 
and  Tetraselmis  QNM1  (B)  cell  extracts  incubated  with  reduced 
dithionite.  Cultures  were  grown  and  then  acclimated  to  dark,  anoxic 
conditions  in  f/2  medium  at  the  indicated  NaCI  levels  (w/v).  Error  bars 
represent  standard  deviations  from  8  experimental  repetitions. 
doi:1 0.1 371/journal.pone.008581 2.g002 

Fermentative  metabolite  analysis 

Fermentative  H,  was  detected  in  the  gas  phase  of  cultures  of 
both  Tetraselmis  GSL1  and  Tetraselmis  QNM1  following  dark, 
anoxic  acclimation  (Fig.  4),  with  H2  yields  being  among  the  highest 
reported  to  date  for  green  algae  (e.g.,  ~20  fold  higher  on  a  per 
chlorophyll  basis  than  those  observed  in  C.  reinhardtii  (Table  SI)). 
Both  Tetraselmis  strains  also  accumulated  C02  in  the  headspace 
(Fig.  4)  during  fermentation,  as  would  be  expected  if  H2 
production  was  linked  to  the  oxidative  decarboxylation  of  pyruvate 
by  pyruvate-ferredoxin  oxidoreducase  with  subsequent  electron 
transfer  from  ferredoxm  to  hydrogenase  [63],  [64],  The  identifi¬ 
cation  of  extracellular  formate,  acetate,  and  ethanol  in  the 
medium  following  acclimation  to  dark,  anoxic  conditions  (Fig.  5) 
suggests  that  these  algae  employ  a  classic  heterofermentation 
pathway,  similar  to  that  which  has  been  observed  in  other  green 
algae  [18].  Metabolite  analyses  indicate  up  to  a  4-fold  increase  in 
fermentative  product  accumulation  at  24  h  of  dark,  fermentation 
relative  to  4  h,  demonstrating  the  ability  of  these  Tetraselmis 
strains  to  maintain  prolonged  anoxic  metabolism. 

Carbohydrate  metabolism 

Total  sugar  content  was  determined  using  the  anthrone  assay 
[25]  since  it  is  likely  that  carbohydrates  serve  as  fermentable 
substrates  in  Tetraselmis.  As  shown  in  Table  1,  concentrations  of 
total  reducing  sugars  increase  as  a  function  of  higher  salinity, 
consistent  with  the  accumulation  of  organic  osmolytes,  as  has  been 
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Figure  3.  In  vivo  H2-photoproduction  rates.  Hydrogen  production 
rates  after  illumination  of  dark,  anoxic  cultures  of  Tetraselmis  GSLI  (A) 
and  Tetraselmis  QNM1  (B).  Bar  graphs  indicate  maximal  initial  H2- 
photoproduction  rates  measured  using  a  custom  built  Clark-type 
electrode.  Cells  were  grown  and  acclimated  to  dark,  anoxic  conditions 
in  f/2  medium  at  the  indicated  NaCI  levels  (w/v).  Error  bars  represent 
standard  deviations  from  10  experimental  repetitions. 
doi:1 0.1 371  /journal.pone.008581 2.g003 


observed  in  other  halophilic  algae  and  cyanobacteria  [41],  [65], 
[66].  During  acclimation  to  dark  anoxia,  carbohydrate  degrada¬ 
tion  was  observed  concomitant  with  fermentative  product 
secretion.  Approximately  30—50%  of  the  carbohydrate  accumu¬ 
lated  was  catabolized  at  the  end  of  24  h  of  anoxia.  Paralleling  the 
increase  in  total  carbohydrate  as  a  function  of  salt  concentration, 
an  increase  in  fermentative  metabolites  is  observed  at  higher  NaCI 
concentrations,  indicative  of  increased  metabolic  flux  through 
fermentation.  Salt-induced  osmotic  stress  has  previously  been 
reported  to  increase  carbohydrate  and  lipid  content  in  green  algae 
and  cyanobacteria,  and  salt-stress  cycling  has  previously  been 
applied  for  improving  biofuel  yields  [65] ,  [66] . 

Transcriptome  analysis  and  hydrogenase  gene  isolation 

Attempts  to  amplify  Tetraselmis  GSLI  HYDA1  from  genomic 
DNA  or  transcript  preparations  using  degenerate  primers  [7]  were 
unsuccessful.  We  therefore  used  whole  transcriptome  sequencing 
from  Tetraselmis  GSLI  after  4  h  of  anoxic  acclimation  for  gene 
discovery  [67].  From  the  transcriptome  dataset,  full  length 
transcripts  with  homology  to  the  hydrogenase  structural  gene 
(HYDA  [24]),  as  well  as  the  hydrogenase  maturases  (HYDE, 
HYDEF,  and  HYDG  [68],  [69])  were  attained.  Only  one  FeFe- 
hydrogenase  gene  (HYDA)  that  encodes  an  H-cluster  domain,  but 
that  lacks  any  F-cluster  domains,  was  identified.  For  all  other  green 


algae  exhibiting  hydrogenase  activity,  and  with  available  tran- 
scriptome/genome  data,  at  least  two  FeFe-hydrogenases  are 
present,  with  Chlorophytes  (e.g.,  Chlamydomonas  and  Scenedes- 
mus)  encoding  H-cluster  only  enzymes  and  Ghlorella  NC64A 
(Trebouxiophyceae)  encoding  two  FeFe-hydrogenases  with  both 
the  H-cluster  domain  and  additional  F-cluster  binding  domains 
[25].  There  are  several  explanations  that  may  account  for  the 
detection  of  only  one  HYDA  homolog:  (a)  Tetraselmis  GSLI 
contains  only  a  single  HYDA  isoform,  which  would  be  the  first 
report  of  a  single  HYDA  isoform  in  a  Chlorophyceaen  alga;  (b) 
other  HYDA  isoform(s)  exist  in  the  Tetrasehnis  GSLI  genome  but 
are  not  expressed  at  a  sufficient  level  for  detection  using  our  assay 
conditions;  or  (c)  other  HYDA  isoform(s)  are  present  in  the 
Tetrasehnis  GSLI  genome  but  are  no  longer  expressed  and 
therefore  not  detectable  by  transcriptome  sequencing. 

Also  unexpected  in  the  transcriptome  data  was  the  finding  that 
a  HYDE  homolog  not  fused  to  HYDF  is  observed  as  an 
independent  transcript.  To  date,  all  other  sequenced  green  algae 
with  hydrogenase  activity  contain  only  a  HYDEF  fusion,  and  this 
is  the  first  time  that  a  transcript  encoding  an  independent  HYDE 
has  been  observed  in  a  green  alga.  Tetraselmis  GSLI  also  has  a 
second  transcript  encoding  a  gene  with  HYDE  homology  that  is 
fused  with  HYDF  (HYDEF).  The  independent  HYDE  transcript 
has  a  3'  untranslated  region  (UTR)  and  translation  stop  site, 
whereas  in  the  HYDEF  fusion,  the  HYDE  portion  of  the  gene  is 
fused  in  frame  with  the  HYDF  coding  sequence  (Fig.  SI).  The 
HYDE  section  of  the  HYDEF  fusion  lacks  both  the  last  19 
nucleotides  found  in  the  independent  HYDE  coding  DNA 
sequence  (CDS),  as  well  as  the  HYDE  3'  UTR.  This  CDS 
deletion  effectively  removes  the  translation  stop  site  and  the  last  6 
amino  acids  at  the  C-terminus  of  HYDE.  An  intronic  sequence  at 
the  3'  end  of  the  HYDE  region  in  HYDEF  allows  an  in  frame 
fusion  with  the  5'  end  of  the  HYDF  gene  (Fig.  SI).  The  presence 
of  a  single  HYDF  independent  of  the  HYDEF  fusion  is  not 
supported  due  to  the  lack  of  transcriptional  evidence  for  an 
independent  HYDF  5 'UTR  and  translation  start  site;  however,  we 
cannot  eliminate  the  possibility  of  a  genomic  copy  of  HYDF  not 
expressed  under  our  conditions. 

The  presence  of  a  single  HYDE  gene,  as  well  as  a  HYDEF 
fusion  was  verified  by  PCR  using  genomic  DNA.  Primers  were 
designed  to  amplify  only  the  HYDE  gene  (sequences  complimen¬ 
tary  to  the  C-terminus  region  of  HYDE  coding  sequence  and  to 
the  HYDE  3'  UTR  downstream  of  the  stop  codon);  and  another 
primer  set  to  amplify  only  the  HYDEF  fusion  (primers 
corresponding  to  the  C-term  region  of  the  HYDE  coding  sequence 
and  the  N-terminus  region  of  the  HYDF  coding  sequence).  We 
confirmed  that  (a)  an  independent  HYDE  gene  (not  fused  to 
HYDF)  with  a  discrete  stop  codon  and  3 'UTR  and  (b)  a  fused 
HYDEF  that  has  neither  the  HYDE  3 'UTR,  nor  a  stop  codon 
after  the  HYDE  coding  sequence  are  encoded  in  the  Tetraselmis 
GSLI  genomic  DNA.  Sequencing  of  a  gDNA  PCR  product  that 
spans  the  region  between  HYDE  and  HYDF  within  the  HYDEF 
fusion  revealed  an  additional  intron  not  present  in  the  indepen¬ 
dent  HYDE  that  directly  connects  to  an  exon  of  the  HYDF  coding 
sequence.  In  sum,  these  data  are  consistent  with  a  simple  fusion 
between  what  was  the  5 '  UTR  of  HYDF  to  the  last  exon  of  a  copy 
of  HYDE,  effectively  eliminating  the  stop  codon  and  3 '  UTR  of 
this  HYDE,  which  generates  a  new  intron  connecting  HYDE  and 
HYDF  coding  sequences.  As  a  negative  control,  primers  were 
designed  corresponding  to  the  3'UTR  of  HYDE  and  the  N- 
terminal  coding  sequence  of  HYDF,  and  as  expected  no  PCR 
product  was  obtained. 

The  relative  expression  of  each  transcript  was  determined  and 
averaged  for  two  biological  replicates  (Table  2).  The  hydrogenase 
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Figure  4.  H2  and  C02  levels  after  dark,  anoxic  acclimation.  H2  (A,  C)  and  C02  (B,  D)  detected  from  sealed  vials  during  dark,  anoxic  acclimation 
of  Tetraselmis  GSL1  (A,  B)  and  Tetraselmis  QNM1  (C,  D).  Cells  were  grown  and  acclimated  to  anoxia  in  f/2  medium  adjusted  to  the  indicated  NaCI 
levels  (w/v).  Error  bars  represent  standard  deviations  from  14  (H2)  and  6  (C02)  experimental  repetitions. 
doi:1 0.1 371  /journal.pone.008581 2.g004 


structural  gene  (HYDA)  had  the  highest  expression  levels  of  all 
hydrogenase  related  genes  and  was  the  16th  most  abundant 
transcript  in  the  entire  transcriptome  assembly,  suggestive  of  a  key 
role  in  the  anoxic  proteome  and  consistent  with  the  substantial 
levels  of  in  vitro  hydrogenase  activity  in  this  alga.  Both  HYDE  and 
HYDG  are  expressed  at  commensurate  levels  with  ~  10-fold  fewer 
transcripts  than  HYDA.  Surprisingly  HYDEF  is  only  expressed  at 
~10%  of  die  levels  of  HYDE  and  HYDG  (100-fold  less  abundant 
than  HYDA). 

Single  nucleotide  polymorphisms  (SNPs)  were  found  in  all  of  the 
hydrogenase  related  transcripts  (Table  3  and  Table  S2).  The 
HYDA  transcript  had  the  most  SNPs  detected  at  39,  while  HYDG 
had  only  a  single  SNP.  The  majority  of  SNPs  are  synonymous 
nucleotide  substitutions  that  result  in  no  amino  acid  change 
(Table  3).  Of  the  34  SNPs  detected  in  the  HYDA  CDS,  only  6  of 
these  result  in  amino  acid  changes,  all  of  which  occur  at  the  N- 
terminus  end  of  the  peptide  and  are  located  outside  of  active  site 
regions.  Genomic  sequencing  of  the  fusion  region  of  HYDEF 
revealed  only  four  SNPs  between  the  HYDE  portion  of  HYDEF 
relative  to  the  independent  HYDE  transcript. 

Phylogeny  of  hydrogenase  and  maturase  enzymes 

We  recently  described  the  presence  of  two  genes  encoding 
HYDA  in  the  green  alga  Chlorella  variabilis  NC64A,  which  is  the 
only  representative  of  the  Chlorophyta  known  to  date  that  has 
FeFe-hydrogenases  containing  F-cluster  domains  [25].  Addition¬ 
ally,  this  study  determined  that  all  algal  hydrogenase  H-clusters 
sequenced  to  date  are  monophyletic,  indicating  a  single  common 
ancestor  for  the  incorporation  of  hydrogenase  activity  in  algal 
metabolism  [25].  We  also  recently  sequenced  the  genome  of  the 
marine  alga  Nannochloropsis  gaditana  (Eustigmatophyceae)  [70] 
and  observed  the  presence  of  a  single  FeFe-hydrogenase  encoding 
both  H-  and  F-cluster  domains  in  this  alga  [71].  As  shown  in 


Figure  6,  rigorous  phylogenetic  analyses  demonstrate  that  the 
recently  obtained  algal  HYDA  sequences  from  Tetraselmis  GSL1 
and  N.  gaditana  are  also  monophyletic  with  the  other  algal  FeFe- 
hydrogenases.  This  is  somewhat  surprising  given  the  evolutionary 
distance  between  Nannochloropsis  and  Tetraselmis,  and  indicates 
a  relatively  early  origin  of  algal  FeFe-hydrogenases,  algal  H2 
metabolism,  and  possibly  algal  fermentation  pathways.  Addition¬ 
ally,  as  the  Nannochloropsis  and  Tetraselmis  hydrogenases  are  the 
first  full-length  marine /halophilic  sequences  available,  this  data 
indicates  that  hydrogenases  in  both  salt-water  and  fresh-water 
algae  are  derived  from  a  common  ancestor. 

As  expected,  HYDA  from  Tetraselmis  GSL1  nested  with  the 
Chlorophyceae,  supporting  the  highest  max  score  and  smallest  e- 
value  obtained  when  aligned  with  C.  reinhardtii  (392  and  9e-136, 
respectively),  compared  to  C.  variabilis  NC64A  enzymes  (367  and 
3e-123)  or  using  blastx  CDS  alignment  (best  four  results 
Chlamydomonas  moewusii,  Volvox  carteri,  Acutodesmus  obli- 
quus,  all  in  the  Chlorophyceae).  Curiously,  while  HYDA 
phylogenetic  analyses  placed  Tetraselmis  GSL1  as  a  sister  clade 
of  the  Chlorophyceae  and  placed  C.  variabilis  NC64A  basal  along 
with  the  more  ancestral  N.  gaditana,  concatemeric  analysis  of 
HYDAEFG  sequences  suggests  a  sister  clade  position  for 
Tetraselmis  and  Chlorella  members;  N.  gaditana  branches  basal 
among  algae  with  genome  sequences  available.  The  discrepancy 
associated  with  the  branching  order  of  Tetraselmis  GSL1  and  N. 
gaditana  HYD  phylogeny  when  either  HYDA  is  analyzed  alone  or 
when  HYDAEFG  are  considered  togedier  may  arise  from 
differences  in  the  number  of  available  sequences  used  to  construct 
the  trees,  leading  to  bias  in  the  concatenated  analysis.  Alterna¬ 
tively,  these  results  may  suggest  slightly  different  evolutionary 
histories  or  selective  pressures  that  have  acted  on  the  structural 
enzyme,  HYDA,  and  the  maturases,  HYDE,  HYDF,  HYDG. 
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Figure  5.  Accumulation  of  selected  metabolites  in  extracellular  medium.  Bar  graphs  indicate  the  levels  of  secreted  formate,  acetate  and 
ethanol  from  cultures  of  Tetraselmis  GSL1  (A,  B,  C)  and  Tetraselmis  QNM1  (D,  E,  F),  respectively.  Cells  were  grown  and  acclimated  to  dark,  anoxic 
conditions  in  f/2  medium  adjusted  to  the  indicated  NaCI  levels  (w/v).  Error  bars  represent  standard  deviations  from  8  experimental  repetitions. 
doi:1 0.1 371  /journal.pone.008581 2.g005 


Previously,  it  was  suggested  that  the  multiple  HYDA  isoforms 
present  in  algal  genomes  are  the  result  of  numerous,  independent 
duplication  events  within  each  individual  lineage  (i.e.,  multiple 
HYDA  isoforms  is  not  an  ancestral  trait)  [25].  The  genome  of  N. 
gaditana  and  the  transcriptome  of  Tetraselmis  GSL1  suggest  that 
each  of  these  strains  encode  a  single  copy  of  HYDA.  When 
coupled  with  HYDA  phylogenetic  data,  which  indicates  that  N. 
gaditana  branches  at  the  base  of  the  algal  lineages,  these  results 
support  the  notion  that  the  progenitor  of  H2  metabolism  in  algae 
encoded  for  only  a  single  HYDA  isoform.  The  observation  that 
HYDA  from  Tetraselmis  GSL1  is  nested  among  algae  with 
multiple  HYDA  isoforms  supports  the  previous  claim  that 
independent  duplication  events  led  to  the  multiple  isoforms 
present  in  these  other  strains. 

Discussion 

Salt  water  systems  have  a  number  of  potentially  intriguing 
applications  in  biological  H2  production  using  phototrophic 


microorganisms  [72].  Saline  water  is  readily  available  and  salt 
concentrations  can  likely  be  leveraged  to  influence  metabolism. 
Because  little  previous  research  has  examined  H2  production  in 
marine  or  halophilic  algae,  we  undertook  an  effort  aimed  at 
identifying  and  characterizing  hydrogenase  activity  in  halophilic 
algae,  and  identified  two  distinct  strains  of  Tetraselmis  with 
fermentative  metabolisms  and  promising  H2-photoproduction 
activities.  To  date,  these  strains  of  Tetraselmis  are  the  most  robust 
H2-producing  phototrophs  isolated  from  high-salt  environments; 
however  it  should  be  noted  that  both  are  relatively  slow  growing. 
Marine  phototrophs  are  responsible  for  a  significant  portion  of 
global  primary  productivity,  can  be  cultured  in  abundant  salt 
water  resources,  and  diverse  species  of  Tetraselmis  are  present  in 
several  saline  ecosystems  [73],  [56].  Moreover,  species  of 
Tetraselmis  have  emerged  as  biotechnologically  relevant  organ¬ 
isms  for  the  production  of  biofuels  such  as  lipids,  ethanol,  or  starch 
and  for  the  extraction  of  higher  value  biocommodities  including 
vitamins  and  ceramides  [74—80],  and  now  H2. 
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Table  1.  Total  reducing  sugars  detected  using  the  Anthrone 
Assay. 


strain  -  salinity 

Time 

jig  tot  sugars/|rg  Chi 

SD 

GSL1  -  3.5% 

0  h 

55.11 

±10.70 

4  h 

50.62 

±9.12 

24  h 

38.00 

±7.45 

GSL1  -  5.5% 

0  h 

63.72 

±14.52 

4  h 

61.13 

±12.07 

24  h 

39.50 

±3.47 

GSL1  -  7.0% 

0  h 

94.73 

±9.25 

4  h 

85.70 

±13.04 

24  h 

49.55 

±8.21 

strain  -  salinity 

Time 

jig  tot  sugars/|rg  Chi 

SD 

QNM1  -  3.5% 

0  h 

58.66 

±4.59 

4  h 

57.09 

±9.54 

24  h 

52.10 

±2.95 

QNM1  -  5.5% 

0  h 

64.43 

±2.12 

4  h 

60.62 

±8.71 

24  h 

52.85 

±4.53 

QNM1  -  7.0% 

0  h 

75.90 

±8.04 

4  h 

51.99 

±11.72 

24  h 

39.95 

±4.59 

Intracellular  reducing  sugar  content  in  cultures  of  Tetraselmis  GSL1  and 
Tetraselmis  QNM1,  grown  and  acclimated  to  dark,  anoxic  conditions  at  the 
indicated  times  and  NaCI  concentrations  (w/v). 
doi:10.1371/journal.pone.0085812.t001 

We  confirmed  that  the  halotolerance  of  Tetraselmis  allows  its 
growth  in  a  wide  range  of  salinities.  The  ability  to  grow  at  ocean 
salinity  (3.5%),  as  well  as  more  saline  environments  makes  these 
organisms  particularly  versatile  for  biotechnological  applications. 
Interestingly,  we  observed  increases  in  both  fermentative  and 
photo-  Hn  production  as  salinity  increased,  despite  decreases  in  in 
vitro  hydrogenase  activities.  Further  studies  are  necessary  to 
uncover  the  mechanistic  underpinnings  of  the  increase  in  H2 
production;  however,  it  is  plausible  that  the  higher  salt  concen¬ 
trations  increase  ATP  demand,  allowing  for  increased  photosyn¬ 
thetic  activity  in  the  case  of  H2  photoproduction,  or  increased 
fermentative  activity  in  the  case  of  dark,  anoxic  Ho  production. 
Both  of  these  pathways  increase  ATP  synthesis/utilization  via 
pathways  that  require  anoxic  terminal  electron  acceptors,  which 
may  explain  the  increases  in  Ho  production.  Although  further 


research  is  necessary  to  verify  or  refute  this  hypothesis,  it  is  clear 
that  salt  can  be  used  to  increase  H2  production  in  these 
Tetraselmis  strains. 

Five  unique  features  of  anoxic  metabolism  that  have  not  been 
obseived  in  more  established  freshwater,  model  systems  were 
obseived  in  these  Tetraselmis  isolates.  First,  hydrogenase  activities 
accumulated  to  levels  that  are  an  order  of  magnitude  greater  than 
is  typically  observed  in  C.  reinhardtii  and  other  green  algae  when 
assayed  on  a  per  unit  chlorophyll  basis.  This  level  of  activity 
greatly  exceeds  the  reducing  equivalents  delivered  to  the  enzyme 
by  either  photosynthetic  or  fermentative  pathways;  suggesting  that 
gains  in  Ha  production  can  be  achieved  by  increasing  reductant  to 
the  hydrogenase  without  having  to  overexpress  the  enzyme. 
Second,  robust  H2  production  coupled  to  either  the  photosynthetic 
electron  transport  chain  or  fermentative  pathways  can  be  realized 
at  NaCI  concentrations  almost  two  times  that  of  sea  water; 
enabling  consideration  of  salt  water  resources  in  efforts  to 
efficiently  produce  biological  H2  from  algae.  Third,  although 
increasing  salt  levels  attenuate  in  vitro  enzyme  activity,  increases  in 
both  H2-photoproduction  and  dark,  fermentative  H2  production 
are  observed  as  a  function  of  increasing  NaCI  levels.  This 
demonstrates  that  salt  levels  can  positively  influence  H2  production 
in  algae.  Fourth,  hydrogenase  activity  is  robust  even  after  24  h  of 
dark  anoxia,  which  is  in  contrast  to  the  reduced  hydrogenase 
activities  reported  for  C.  reinhardtii  at  this  point  in  anoxic 
acclimation  [61].  Carbohydrate  analysis  indicated  that  well  over 
half  of  the  available  reducing  sugars  remain  within  the  cells  at  this 
point  of  anoxic  acclimation,  and  it  may  be  that  these  Tetraselmis 
strains  are  “hardwired”  to  withstand  extended  periods  of  anoxia. 
Fifth,  fermentative  H2  production  in  the  dark  is  almost  20-fold 
greater  on  a  per  unit  chlorophyll  basis  than  reported  for  C. 
reinhardtii,  suggesting  that  hydrogenase  may  be  more  effectively 
integrated  into  fermentative  pathways. 

In  addition  to  the  biotechnological  features  described,  we  are 
also  interested  in  (a)  characterizing  phototroph  metabolism  in  the 
dark  when  anoxia  occurs  in  aquatic  “dead-zones”  and  photo- 
trophs  rely  on  fermentation  for  ATP  production  and  (b) 
developing  a  more  informed  understanding  of  the  evolution  of 
FeFe-hydrogenases  in  eukaryotic  phototrophs  to  better  define  why 
hydrogenase  activity  is  observed  in  these  oxygenic  photosynthetic 
organisms.  Eukaryotic  phototrophs  with  hydrogenase  activity 
typically  exhibit  anoxic  heterofermentation  activity  in  the  dark, 
resulting  in  the  secretion  of  alcohols,  organic  acids  and  H2  [18], 
These  energy  carriers  and  organic  building  blocks  secreted  by  the 
algae  are  likely  used  by  the  consortia  of  heterotrophic  microor¬ 
ganisms  that  coexists  with  aquatic  phototrophs.  These  metabolites 
likely  influence  the  population  dynamics  of  the  aquatic  microbiota 
over  temporal  (diurnal,  seasonal  cycles)  and  spatial  (vertical 
depth)  gradients.  Such  features  warrant  further  investigation  of 


Table  2.  HYD  gene  transcript  lengths,  relative  expression  levels  and  rank  with  respect  to  the  cellular  transcriptome. 


Gene 

Number  of  homologs 

Transcript  length  (nt) 

Length  of  predicted 
protein  (AA) 

RPKMa 

Expression  rank  in 
transcriptome  assembly15 

HYDA 

1 

1,825 

461 

1461  ±32 

16 

HYDE 

1 

2,038 

492 

1 18+1.6 

655 

HYDEF 

1 

3,446 

997 

14±3.0 

7,088 

HYDG 

1 

2,033 

541 

1 1 3±0.6 

631 

Transcript  expression  level  in  reads  per  kilobase  per  million  mapped  reads.  Given  is  the  average  expression  and  range  for  two  biological  replicates. 

bRank  of  transcript  expression  when  compared  to  entire  transcriptome  assembly,  with  one  being  the  most  expressed  transcript  out  of  a  total  of  31,619  transcripts. 

doi:1 0.1 371 /journal. pone.0085812.t002 
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Table  3.  Single  nucleotide  polymorphisms  (SNPs)  in  HYD  transcripts. 


Genes 

SNPs  in  CDS 

AA  changes 

Avg.  Allele  1  Frequency3 

Avg.  Allele  2  Frequency3 

HYDA 

34 

6 

50.6±6.6 

47.6±5.9 

HYDE 

0 

0 

- 

- 

HYDE/EFb 

4 

2 

90.1  ±2.8 

8.3  ±1.8 

HYDG  1  1  49.7  48.6 


aAverage  frequency,  in  percentage,  of  ali  CDS  SNPs  of  given  allele  found  in  sequenced  transcriptome.  For  HYDE/EF  Allele  1  represents  HYDE  and  Allele  2  represents 
HYDEF. 

bHYDE/EF  indicates  the  comparison  of  the  HYDE  region  of  the  HYDE  and  HYDEF. 
doi:1 0.1 371/journal,  pone.008581 2.t003 


phototroph  fermentation  in  order  to  develop  an  understanding  of 
the  environmental  cues  influencing  metabolism,  catalog  the 
products  secreted  by  microbial  primary  producers,  and  charac¬ 
terize  the  effects  of  fermentative  product  secretion  on  microbial 
population  dynamics. 

Similar  to  C.  reinhardtii,  formate,  acetate,  and  ethanol  were 
secreted  into  culture  media  and  Ha  and  C02  were  evolved 
following  dark,  anoxic  acclimation.  Other  H2  producing  algae  can 
secrete  different  products  such  as  lactate  or  glycerol  after  anoxic 
acclimation  [50].  Although  different  organic  acids  and  alcohols 
can  be  secreted,  all  H2  producing  algae  characterized  to  date 
exhibit  a  form  of  heterofermentation,  which  is  also  observed  in  the 
species  of  Tetraselmis  described.  Muller  et  al.  recently  proposed 
that  a  base  set  of  anoxic  metabolisms  was  likely  present  in  the 
single  common  eukaryote  ancestor,  and  that  most  contemporary 
eukaryotic  anoxic  metabolisms  exhibit  at  least  a  subset  of  these 
original  pathways  [81].  Many  eukaryotes  retain  features  of  these 
core  enzymes,  with  hallmarks  including  enzymes  such  as  the  FeFe- 
hydrogenase,  pyruvate  ferredoxin  oxidoreductase,  pyruvate  for¬ 
mate  lyase,  lactate  dehydrogenase  and  alcohol  dehydrogenase, 

A  HYDA  allignment 


among  other  fermentative  enzymes  enabling  the  production  and 
secretion  of  FI2,  organic  acids  and  alcohols  [82].  Despite  greater 
than  a  billion  years  of  evolution  and  ample  opportunity  to  acquire 
different  modes  of  anoxic  metabolism  by  lateral  gene  transfer  from 
bacteria  and  archaea,  most  eukaiyotes  with  active  anoxic 
metabolisms  leverage  either  fermentation  or  relatively  simple 
anoxic  respiration  pathways.  This  hypothesis  is  consistent  with  our 
observation  that  Tetraselmis  readily  activates  heterofermentation 
pathways  during  anoxia  using  a  subset  of  the  ~50  enzymes  posited 
by  Muller  et  al.  to  be  present  in  the  eukaryote  common  ancestor 
[81].  The  Tetraselmis  GSL1  transcriptome  (see  below)  was 
attained  as  part  of  an  effort  to  obtain  transcriptomes  from 
hundreds  of  eukaryotic  algae  to  mine  this  information  for  unique 
gene  content  in  marine  algae.  When  completed,  this  project  will 
provide  a  powerful  dataset  for  use  in  determining  whether 
evidence  exists  that  eukaryotic  phototrophs  have  acquired  other 
anoxic  metabolisms  outside  of  the  core  proposed  by  Muller  et  al. 
by  lateral  gene  transfer,  which  does  not  appear  to  be  a  frequent 
occurrence. 


B  HYDAEFG  allignment 


Thermosipho  afncanus  TCF52B  (YP_00 233 5580) 

Thermotoga  maritima  MSB8JEHA58850) 

150198) 
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Chlorella  sir  NC64A-HydA1 
Chlorella  sir  NC64A-HydA2 
Tetraselmis  GSL1 
Chtorococcum  submannum 

Chlamydomonas  nocUgama  HydA3 
— Chlamydomonas  noctigama  HydA2 
Chlamydomonas  noctigama  HydAI 
— Chlamydomonas  moewusii  HvdA  ? 

Chlamydomonas  moewusii  HydA2  (AAT9043Q) 

ii - Chlamydomonas  reinhardtn  HydAI  (AAL23572) 

»00P - Chlamydomonas  reinhardtii  HydA2(AAL23573) 

-Votvox  car  ten  f  naganensis  HydA2  (EFJ5035Q) 
Vol/ox  carteo  f  naganensis  HydA  1  (EFJ50362) 

*c  I22J - Scenedesmus  oblquus  HydA2 

T_|‘-  Scenedesmus  oblquus  HydAI  (CAC34419) 

— Scenedesmus  vacuoiatus  (CAC83291) 

Haeglena  gruben  sir  NEG-M  (XP_002674266) 

100 1 — Piromyoes  sp  E2  (AAL 90459) 

' - Neocallimastix  frontalis 


stix  frontalis  (AAN39277) 
pseudonana  C CMP  1335 


89 


94 


-Trichomonas  vaginalis  G3 
-Naegleria  gruberi 

-Nannochloropis  gaditana  CCMP526 


-Tetraselmis  GSL1 
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™»-Volvox  carter i  f.  naqariensis 


naganensis 
-Bacteroides  thetaiotaomicron  VPI-5482 


Thalassiostra 

Blastocystrs  sp  Nandll  (ACD10930) 

■Trichomonas  vag  malts  G3  (XP_0015S0286) 

Trichomonas  vaginalis  G3  (XP_001 322682) 
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■Trichomonas  vi 
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Figure  6.  Maximum-likelihood  phylogenetic  reconstruction  of  individual  HYDA  sequences  (A)  and  a  concatentation  of  HYDAEFG 
(B).  Bootstrap  support  values  are  indicated  at  each  node.  The  root  in  the  HYDA  (NAR1,  paralogs  of  HYDA)  was  pruned  to  conserve  space.  Given  that 
NAR1  paralogs  of  HYDEFG  do  not  exist,  a  different  outgroup  was  chosen  (i.e  HYDAEFG  from  Bacteroides  thetaiotaomicron)  on  the  basis  that  the 
HYDA  that  it  encodes,  which  branches  basal  to  the  eukaryote  HYDA  lineage. 
doi:1 0.1 371  /journal. pone.008581 2.g006 
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Phylogenetic  analysis  of  algal  hydrogenases  sequenced  to  date 
show  that  these  enzymes  are  all  monophyletic,  indicative  of  a 
common  ancestor  and  that  distinct  lateral  gene  transfer  of 
hydrogenase  among  the  various  algal  lineages  has  not  occurred. 

To  determine  whether  this  is  also  the  case  for  Tetraselmis 
hydrogenases  and  to  determine  if  halophiles  acquired  distinct 
hydrogenase  enzymes  from  sources  that  are  unrelated  to 
freshwater  algae,  we  performed  in  depth  transcriptome  analyses 
(n  =  2)  after  4  h  of  anoxic  acclimation  using  Tetraselmis  GSL1. 
Two  surprising  features  emerged  from  these  analyses  that  have  not 
previously  been  described.  First,  the  anoxic  transcriptome  of 
Tetraselmis  GSL1  suggests  a  single  FeFe-hydrogenase  with  only 
the  Pi-cluster  binding  domain.  Without  whole  genome  sequencing, 
it  is  impossible  to  rule  out  the  possibility  that  other  FIYDA 
homologs  are  present  in  the  Tetraselmis  GSL1  genome.  However, 
if  other  HYDA  homologs  are  present,  their  expression  is 
sufficiently  low  after  4  h  of  anoxia  that  we  could  not  detect 
transcripts  in  our  dataset.  This  is  not  consistent  with  other  green 
algae,  such  as  Chlamydomonas,  in  which  both  HYDA1  and 
HYDA2  transcripts  increase  after  exposure  to  anoxia  and  have 
been  detected  4  h  after  anaerobic  induction  [63].  The  depth  of 
coverage  obtained  by  transcriptome  sequencing  was  high  enough 
to  capture  HYDA  precursor  niRNA  that  contained  unspliced 
introns  (data  not  shown),  so  it  seems  unlikely  that  an  alternative 
HYDA  homolog  is  being  expressed  at  substantive  levels  and  not 
being  detected.  It  seems  likely  that  Tetraselmis  encodes  only  a 
single  FeFe-hydrogenase,  and  if  so,  it  is  the  first  green  alga 
described  that  lacks  a  secondary  duplication  of  this  gene.  As 
several  SNPs  were  found  in  Tetraselmis  GSL1  HYDA,  we 
analyzed  the  SNP  frequency.  Our  results  indicate  these  34 
nucleotide  differences  are  a  result  of  multiploidy,  were  localized 
to  the  putative  transit  peptide,  and  resulted  in  only  6  AA  changes 
(98.7%  identical  AA).  Chlamydomonas  reinhardtii  is  haploid  and 
the  HYDA2  isoform  in  this  alga  shares  only  68%  AA  identity  to 
HYDA1  [24],  Moreover,  the  promoter  regions  of  HYDA1  and 
HYDA2  are  unique  and  lack  significant  regions  of  sequence 
homology,  which  may  suggest  differences  in  the  regulation  of  gene 
expression,  which  has  been  experimentally  verified  [83].  This 
phenomenon  is  not  likely  to  be  the  case  for  the  two  Tetraselmis 
GSL1  HYDA  transcripts  that  share  very  high  similarity  (99.5% 
nucleotide  identity  in  the  CDS),  even  higher  in  the  promoter 
region.  Moreover,  the  averaged  abundance  is  very  similar  for  both 
HYDA  alleles  (Table  2),  suggesting  no  differential  expression  for 
the  two  allelic  transcripts  and  consistent  with  a  diploid  organism. 

Second,  the  Tetraselmis  GSL1  transcriptome  encodes  two 
homologs  to  HYDE,  one  of  which  exists  as  a  single  transcript  and 
the  other  of  which  exists  as  a  HYDEF  fusion.  This  represents  the 
first  report  of  the  coexistence  of  2  HYDE  homologs  in  an  algal 
genome,  only  one  of  which  is  fused  inframe  to  HYDF.  A  single 
HYDEF  fusion  is  observed  in  all  other  green  algae  with 
hydrogenase  activity  sequenced  to  date  [25].  We  speculate  that 
the  presence  of  two  HYDE  homologs  in  Tetraselmis  GSL1 
represents  a  key  evolutionary  intermediate  pointing  to  the  fusion 
event  that  led  to  the  emergence  of  HYDEF,  which  may  have 
occurred  within  the  algal  lineages  as  a  way  to  economize  protein 
translocation  to  the  plastid.  HYDE  was  putatively  incorporated 
into  algal  genomes  as  a  single  gene  early  in  the  evolution  of  algae 
and  can  still  be  found  as  an  independent  form  in  Nannochloropsis 
where  HYDE  and  HYDF  remain  separate  despite  being  spatially 
located  next  to  each  other  in  a  genomic  hydrogenase  gene  cluster 
[71].  Analysis  of  the  AA  linker  region  between  the  HYDE  and 
HYDF  regions  of  fused  HYDEF  reveals  no  conserved  motifs 
between  currently  sequenced  algae  (data  not  shown).  If  the  fusion 
event  occurred  multiple  times  in  various  algal  lineages  this  may 

PLOS  ONE  |  www.plosone.org  1 1 


account  for  the  lack  of  any  conserved  HYDEF  linker  region 
because  it  was  not  present  in  the  last  common  algal  ancestor.  Why 
Tetraselmis  GSL1  retained  an  independent  copy  of  HYDE  is 
unknown;  however,  as  shown  in  Table  2,  the  transcript 
abundances  of  HYDE  and  HYDEF  are  dramatically  different 
(Table  2).  This  may  be  directly  correlated  to  the  function  of 
HYDE  as  the  putative  source  for  the  dithiolate  ligand  at  the  FeFe- 
hydrogenase  active  site  [84],  [85];  whereas,  HYDF  may  function 
in  a  catalytic  role,  consistent  with  its  role  as  a  scaffold  protein  for 
active  site  construction  and  insertion  [86] .  It  is  unclear  why  HYDE 
would  be  fused  to  HYDF  in  this  scenario. 

Intriguing  aspects  of  phylogenetically  linked  enzymatic  adapta¬ 
tions  are  emerging  among  the  various  algal  linkages  and  the 
sequences  reported  here  provide  additional  evidence  regarding  the 
trajectory  of  algal  hydrogenase  acquisition  and  evolution.  To  date, 
eukaryotic  algae  have  only  been  demonstrated  to  encode  FeFe- 
hydrogenases  [1]  which  is  further  supported  by  our  transcriptome 
analysis.  Inconclusive  studies  indicating  the  presence  of  NiFe- 
hydrogenases  have  been  reported  [87],  [36],  Although  the  Bhosale 
et  al.  report  contends  that  a  NiFe-hydrogenase  is  present  in  a 
strain  of  Tetraselmis,  we  find  no  support  of  a  NiFe-hydrogenase  in 
Tetraselmis  GSL1  and  only  find  evidence  of  FeFe-hydrogenase. 
Very  recently,  a  partial  HYDA  CDS  from  Tetraselmis  subcordi- 
formis  was  submitted  to  GenBank  (Accession  number:  JQ3 17304), 
which  has  85%  similarity  and  50%  coverage  with  respect  to  the 
HYDA  AA  sequence  reported  here. 

Among  the  Chlorophyceae,  all  FeFe-hydrogenases  encode  only 
the  H-cluster  binding  domain  and  the  presence  of  a  hydrogenase 
with  F-cluster  binding  domains  has  yet  to  be  reported  in  this  class. 
To  date,  these  truncated  H-cluster  only  hydrogenases  are 
exclusively  restricted  to  the  Chlorophyceaen  algae.  It  is  unclear 
why  these  truncated  variants  emerged  in  evolution,  but  the 
presence  of  an  H-cluster  only  enzyme  in  a  halophilic  specie  of 
Tetraselmis  (Chlorophyceae)  is  consistent  with  the  presence  of  H- 
cluster  only  enzymes  early  in  the  evolution  of  the  Chlorophyceae, 
and  indicates  that  both  fresh  water  and  salt  water  members 
metabolically  incorporate  the  H-cluster  only  enzyme  variant. 
Currently,  the  most  robust  H2-photoproduction  activity  is 
obseived  in  the  Chlorophyceae  and  truncation  may  enable  more 
efficient  coupling  to  the  photosynthetic  electron  transport  chain 
[25].  There  are  now  full-length  algal  FeFe-hydrogenase  sequences 
for  two  algae  outside  of  the  Chlorophyceae  [25],  [71]  and 
interestingly  these  organisms  both  encode  enzymes  with  H-cluster 
and  F-cluster  domains  and  exhibit  comparatively  lower  H2- 
photoproduction  rates.  The  Trebouxiophyte,  Chlorella  variabilis 
NC64A  contains  two  F-cluster  FeFe-hydrogenases  [25],  whereas 
marine  Nannochloropsis  strains  encode  only  a  single  F-cluster 
containing  FeFe-hydrogenase  [69]. 

Phylogenetic  analyses  of  algal  FeFe-hydrogenases  demonstrate 
that  the  freshwater  strains  with  sequenced  transcriptomes/ 
genomes  contain  at  least  two  hydrogenase  isoforms  that  are  most 
similar  to  each  other  with  respect  to  the  other  algal  hydrogenases, 
indicating  gene  duplication  within  each  of  these  individual  algal 
lineages  [25].  The  marine  organisms  with  available  transcrip- 
tomes/genomes  (Tetraselmis  and  Nannochloropsis  [71],  [88]) 
encode  only  a  single  HYDA  isoform  and  both  encode  a  copy  of 
HYDE  that  is  not  fused  to  HYDF.  Such  observations  may  imply 
that  marine  organisms  were  not  subject  to  the  same  evolutionary 
pressures  that  resulted  in  the  hydrogenase  duplication  events 
obseived  in  fresh  water  strains.  The  single  HYDE  protein  is  more 
difficult  to  interpret  as  Tetraselmis  GSL1  also  has  HYDEF. 

In  sunt,  strains  of  Tetraselmis  are  promising  candidates  for  the 
efficient  production  of  H2  in  saline  systems  and  salt  can  be 
leveraged  as  a  means  to  positively  influence  H2  production.  The 
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results  of  HYDA  transcriptome  sequencing  and  phylogenetic 
analyses  are  consistent  with  speculation  that  FeFe-hydrogenases 
emerged  in  algal  genomes  only  a  single  time,  perhaps  as  far  back 
as  the  primary  endosymbiotic  event,  and  that  only  a  single  FeFe- 
hydrogenase  with  H-cluster  and  F-cluster  domains  was  present 
[51],  [25].  The  FeFe-hydrogenase  was  likely  a  component  of  a 
suite  of  fermentation  enzymes  present  in  an  ancestral  eukaryote 
that  functioned  in  adaptation  to  anoxia  and  was  subsequently 
evolved  to  couple  to  the  photosynthetic  electron  transport  chain 
under  some  conditions.  During  evolution,  the  Chlorophyceae 
selected  a  truncated  H-cluster  only  enzyme,  and  the  green  algae 
generated  a  HYDEF  fusion  by  translocating  the  5'-UTR  of  a 
single  HYDF  into  the  end  of  HYDE  S'-coding  sequence  to 
generate  a  new  intron  with  in  frame  splicing  to  generate  a  HYDEF 
fusion.  Freshwater  algae  then  experienced  selective  pressures  to 
duplicate  the  HYDA  sequence,  although  for  unknown  reasons. 
Although  additional  research  is  required  to  better  define  and 
improve  Ha  production  from  halophilic  species  of  Tetraselmis  and 
new  genome  information  is  required  to  attain  more  definitive 
insights  into  the  origins  and  evolution  of  hydrogenase  in  algae,  the 
strains  of  Tetraselmis  described  here  help  to  fill  existing  knowledge 
gaps  in  both  of  these  areas. 

Supporting  Information 

Figure  SI  Schematic  representation  of  the  independent 
HYDE  and  fused  HYDEF  transcripts  obtained  from 
transcriptome  sequencing.  The  percent  nucleotide  identity  of 
each  transcript  to  each  other  is  given.  Detected  SNPs  between 
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ABSTRACT:  The  [FeFe]-hydrogenase  catalytic  site  FI  cluster  is  a  complex  iron  sulfur  cluster  assembly  that  is  sensitive  to 
oxygen  (02).  The  02  sensitivity  is  a  significant  barrier  for  production  of  hydrogen  as  an  energy  source  in  water -splitting, 
oxygenic  systems.  Oxygen  reacts  directly  with  the  FI  cluster,  which  results  in  rapid  enzyme  inactivation  and  eventual 
degradation.  To  investigate  the  progression  of  02-dependent  [FeFe]-hydrogenase  inactivation  and  the  process  of  FI 
cluster  degradation,  the  highly  02  sensitive  [FeFe]-hydrogenase  FlydAi  from  the  green  algae  Chlamydomonas  reinhardtii 
was  exposed  to  defined  concentrations  of  02  while  monitoring  the  loss  of  activity  and  accompanying  changes  in  FI  cluster 
spectroscopic  properties.  The  results  indicate  that  H  cluster  degradation  proceeds  through  a  series  of  reactions,  the  rate 
and  extent  of  which  depend  on  the  initial  enzyme  reduction/oxidation  state.  The  degradation  process  begins  with  02 
binding  and  reacting  with  the  2Fe  subcluster,  leading  to  degradation  of  the  2Fe  subcluster  and  leaving  an  inactive  [qFe- 
4S]  subcluster  state.  In  addition,  Cys  169  that  has  been  implicated  as  part  of  the  proton  transfer  pathway  was  observed  to 
have  reacted  with  a  reactive  oxygen  species  to  form  sulfenic  acid.  This  final  inactive  degradation  product  could  be 
reactivated  in  vitro  by  incubation  with  2Fe  subcluster  maturation  machinery,  specifically  HydFEG,  which  was  observed  by 
recovery  of  enzyme  activity. 


■  INTRODUCTION 

[FeFe]-hydrogenases  are  found  in  bacteria  and  lower 
eukaryotes,  and  are  commonly  involved  in  the  recycling 
of  reduced  electron  carriers  that  accumulate  during 
anaerobic  metabolism.1  [FeFe]-hydrogenases  catalyze 
reversible  Fl2  activation  at  very  high  rates  and  thus  are 
attractive  targets  for  bioengineering  efforts  aimed  at 
coupling  microbial  H2  production  to  oxygenic 
photosynthesis2,3  However,  [FeFe]-hydrogenases  are 
rapidly  inactivated  upon  exposure  to  02,  the  byproduct  of 
water  oxidation.4  In  [FeFe]-hydrogenases,  proton 
reduction  occurs  at  a  complex  bridged  FeS  cluster  termed 
the  H  cluster.  The  H  cluster  exists  as  a  regular  [4Fe-4S] 
subcluster  bridged  to  an  organometallic,  2Fe  subcluster 
through  a  protein  cysteine  thiolate.  The  2Fe  subcluster  is 
coordinated  by  unique  non-protein  ligands  including  CO, 
CN,  and  dithiomethylamine  (Figure  i).5"9  The  02 
sensitivity  has  been  attributed  to  redox  reactions  with  02, 
and  subsequent  destruction  of  the  H  cluster  by  reactive 


Figure  1.  H  cluster  ball-and-stick  representation  with 
carbon,  nitrogen,  oxygen,  sulfur,  and  iron  atoms  colored 
grey,  blue,  red,  yellow,  and  rust  respectively  (PDB  3C8Y). 
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oxygen  species  (ROS),  rendering  the  enzyme  irreversibly 
inactivated.10 

Although  all  of  the  characterized  [FeFe]-hydrogenases 
have  been  shown  to  be  sensitive  to  02  inactivation.  The 
enzymes  from  different  sources  have  varying  sensitivity  to 
02,  which  has  been  attributed  largely  to  differences  in  the 
access  of  the  active  sites  to  02.4,11"15  Computational  studies 
have  revealed  putative  channels  that  are  proposed  to 
function  in  the  diffusion  of  gases,  including  02,  to  and 
from  the  active  site.11  It  was  also  discovered  through 
computational  analysis  and  enzymatic  assays  of  site- 
directed  variants  that  constricting  the  gas  channels  can 
decrease  the  sensitivity  of  the  [FeFe]-hydrogenase  to  02 
but  not  to  a  significant  enough  degree  to  be  useful  for 
applications  in  technology.11’13"15  A  random  mutagenesis 
approach  was  used  to  screen  thousands  of  [FeFe]- 
hydrogenase  variants  leading  to  isolation  of  a  more  02 
tolerant  variant  with  several  site  substitutions.  The  effect 
of  each  mutation  was  individually  tested,  and  showed  that 
each  contributed  to  an  additive  effect  on  the  overall  02 
tolerance  of  the  [FeFe]-hydrogenase.15 

Theoretical  studies  of  inactivation  have  measured  the 
thermodynamics  of  O,  binding  and  reactivity  with  the  FI 
cluster,  and  probed  the  subsequent  reaction  steps  that 
ultimately  lead  to  enzyme  degradation.  The  models 
suggest  that  initiation  of  02  interaction  with  hydrogenase 
catalytic  is  dependent  on  the  oxidation  state, 
coordination  environment  and  redox  activity  of  the  Fe 
sites.16"20  In  the  FI  cluster  reaction  models,  02  binds  at  the 
distal  Fe,  which  is  assigned  to  the  (I)  oxidation  state  for 
Hox  with  a  loosely  bound  water.  Reaction  of  the  bound  02 
with  Fe(I)  leads  to  formation  of  a  terminally  bound  -OOFI 
or  -O,"  species.  Further  reaction  cycles  of  the  terminal 
“O”  species  with  Fl+  generates  H202  or  iron-peroxide  as 
end-products.19  Alternatively,  2Fe  subcluster  oxidation,  or 
O  atom  insertion  into  the  distal  Fe  terminal  CO  ligand 
can  release  CO,.19  The  exact  chemical  nature  of  the  O- 
species  produced  from  02  reactions  at  the  H  cluster 
remain  to  be  experimentally  validated. 

Structural  and  biophysical  studies  on  how  02  reacts 
with  the  H  cluster  to  cause  inactivation  are  challenging, 
and  there  are  limited  experimental  studies  on  this 
process.21’22  A  mechanism  of  FI  cluster  degradation  by  02 
has  been  proposed  from  the  results  of  X-ray  absorption 
spectroscopy  (XAS)  studies  in  which  reactive  02  species 
produced  at  the  2Fe  subcluster  result  in  enzyme 
inactivation  by  destruction  of  the  [qFe-qS]  subcluster.10’23 
Those  results  contrast  with  previous  models  whereby  02 
accesses  and  binds  to  the  2Fe  subcluster  to  form  ROS  or 
other  end-products  that  can  readily  degrade  the  2Fe 
subcluster  followed  by  loss  of  enzyme  activity.19  Thus,  in 
order  to  reexamine  the  mechanistic  process  of  02 
inactivation  of  [FeFe]-hydrogenases,  we  have  exposed  the 
[FeFe] -hydrogenase  from  Chlamydomonas  reinhardtii 
(CrHydAi)  to  low  titers  of  02  while  monitoring  changes  in 
the  biochemical  activity  and  changes  in  Fourier  transform 
infrared  (FTIR)  and  Ultraviolet-visible  (UV-Vis) 
spectroscopic  properties.  The  FTIR  results  indicate  that 
H  cluster  reacts  with  02  and  undergoes  a  series  of 
reactions  to  produce  a  mixture  of  intermediates,  the 


populations  of  which  depend  on  the  initial 
reduction/oxidation  state  of  the  FI  cluster.  Ultimately, 
oxidative  destruction  results  in  the  loss  of  the  2Fe 
subcluster  and  formation  of  a  [qFe-qS]  subcluster  state 
observed  with  UV-Vis  and  in  the  X-ray  crystal  structure. 
This  inactivation/degradation  end  product  could  be 
reactivated  in  vitro  by  incubation  with  the  2Fe  subcluster 
specific  maturation  machinery,  as  observed  by  enzyme 
activity  assays. 

■  EXPERIMENTAL  SECTION 

Protein  Preparation.  Fleterologous  expression  and 
purification  of  CrHydAi  in  Escherichia  coli  was  done  as 
previously  described.24,25  The  protein  was  purified  under 
anaerobic  conditions  in  a  MBraun  anaerobic  chamber 
(MBraun  USA).  All  buffers  were  degassed  under  vacuum 
before  purification.  CrHydAi  isolation  from  cell  extracts 
was  performed  using  a  two-step  chromatography  process 
of  ion-exchange  over  diethylaminoethanol  (DEAE,  GE 
Lifesciences)  Sepharose  followed  by  affinity  capture  on 
Strep-Tactin  (IBA)  resin.25  Strep-Tactin  bound  enzyme 
was  eluted  in  50  mM  Tris  buffer  (pH  8.0)  containing  300 
mM  NaCl,  5%  glycerol,  5  mM  sodium  dithionite  (NaDT), 
and  2  mM  desthiobiotin.  Purity  was  verified  by  sodium 
dodecyl  sulfate  polyacrylamide  gel  electrophoresis  (SDS- 
PAGE),  and  the  concentration  determined  by  Bradford 
assay.  CrHydAi  was  prepared  in  the  Hox  state  by  serial 
concentration  and  dilution  in  NaDT-free  buffer  until  the 
FTIR  spectra  consisted  primarily  of  Hox.  CrHydAi  was 
prepared  in  the  Hox-CO  state  by  brief  sparging  under 
100%  CO  gas,  and  10  min  incubation,  in  a  sealed  serum 
vial.  Aliquots  of  CrHydAi  were  prepared  in  Hred-H2  by  10 
evacuation  flush  cycles  with  100%  H2  on  a  Schlenk  line 
fitted  with  an  02  trap. 

HydF  was  obtained  by  expression  of  hydF  in  the 
background  of  HydE  and  HydG  (HydFEG)  in  E.  coli  strain 
BL21  (DE3).  The  hydE,  hydF,  and  hydG  from  Clostridium 
acetobutylicum  were  individually  cloned  into  pET-Duet, 
pRSF-Duet,  and  pCDF-Duet,  respectively.26  The  cloned 
copy  of  hydF  contained  a  N-terminal  6xHis  tag.26  Cells 
were  grown  in  LB  Miller  growth  medium  supplemented 
with  streptomycin  (50  mg  L  '),  kanamycin  (30  mg  L  '), 
ampicillin  (100  mg  L  '),  0.5%  w/v  glucose  (-25  mM),  2  mM 
ferric  ammonium  citrate  and  50  mM  phosphate  buffer 
(final  pH  of  medium  was  7.4).  All  cultures  were  grown 
aerobically  at  25  °C  until  an  OD6oo  of  0.5.  Cultures  were 
sparged  with  100%  argon  for  20  min,  and  induced  with  1.5 
mM  isopropyl  (3-D-i-thiogalactopyranoside  (IPTG). 
Cysteine  (2  mM)  and  sodium  fumarate  (25  mM)  were 
added  immediately  after  IPTG  addition.  Cultures  were 
sparged  with  argon  at  25°C  overnight. 

HydFEG  was  purified  in  an  anaerobic  chamber  (Coy 
Labs,  Grass  Lake  MI)  as  previously  described.26  Cells  were 
harvested  by  centrifugation  and  cell  pellets  stored  at  -80 
°C.  Cells  were  lysed  by  resuspension  in  buffer  composed 
of  10  mM  HEPES,  pH  7.4,  0.5  M  KC1,  5%  glycerol,  1  mM 
dithiothreitol,  20  mM  imidazole,  20  mM  MgCl2,  1  mM 
PMSF,  1%  Triton  X-100,  140  pg  ml"1  DNAse  and  RNAse, 
and  120  pg  ml"1  lysozyme.  The  cell  lysate  was  stirred  for 
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i  h  at  room  temp,  and  centrifuged  in  gas  tight  bottles  at 
38,000  x  g  for  30  min.  The  His-HydFEG  was  purified  from 
the  supernatant  by  immobilized  metal  chromatography 
on  a  TALON  cobalt  column  (GE  Life  Sciences).  The 
column  was  loaded  and  washed  with  15  column  volumes 
of  20  mM  HEPES  pH  7.4,  0.5  M  KC1,  5%  glycerol,  1  mM 
dithiothreitol,  and  20  mM  imidazole.  Purified  HydFEG  was 
eluted  with  wash  buffer  containing  200  mM  imidazole. 
HydFEG  was  collected  and  concentrated  anaerobically 
with  30  kDa  Amicon  Ultra-15  centrifugal  concentrators 
(Millipore).  The  HydFEG  was  loaded  onto  a  G25  PD-10 
desalting  column  (GE  Life  Sciences)  to  remove  imidazole 
and  eluted  with  50  mM  HEPES  pH  7.4,  0.5  M  KC1,  5% 
glycerol,  and  1  mM  dithiothreitol.  HydFEG  was  flash 
frozen  in  liquid  N2  and  stored  at  -80  °C  or  in  liquid  N2 
until  further  use. 

Preparation  of  Oxygen  Treated  CrHydAi.  O,  gas 

standards  were  prepared  using  septum-sealed  123  ml 
Wheaton  vials.  Dilutions  were  prepared  using  a  gas-tight 
syringe  (Hamilton)  to  remove  gas  from  a  100%  02 
standard  (equilibrated  to  atmospheric  pressure)  and 
injecting  the  02  into  vials  containing  100%  N2  at 
atmospheric  pressure.  Standards  of  100%,  4%,  1%,  and 
0.1%  02  were  used  for  all  of  the  02  additions.  Samples 
used  to  measure  the  FTIR  spectra  of  02  treated  CrHydAi 
were  prepared  as  follows:  HydAi  at  30-70  mg  ml"1  was 
pipetted  into  a  septum-sealed  825  pi  conical  vial  on  ice, 
and  02  (final  partial  pressure  concentrations  of  0.01- 
24.2%)  was  added  by  gas  tight  syringe  with  magnetic 
stirring.  A  5.5  pi  aliquot  of  CrHydAi  was  removed  by 
syringe  and  the  FTIR  spectra  were  collected  in  a  custom 
gas-tight  FTIR  sample  cell.27  For  the  02  reaction 
characterization  by  UV-Vis  spectroscopy,  02was  injected 
in  a  low  volume  sealed  quartz  cuvette  and  the  reaction 
was  scanned  once  per-min,  for  200-300  min. 

FTIR  Spectroscopy.  Spectra  were  collected  as 
described  previously  with  a  Nicolet  6700  FTIR 
spectrometer  (Thermo  Fisher  Scientific)  equipped  with  a 
Globar  IR  source,  a  CaF2  beam  splitter,  and  a  liquid- 
nitrogen-cooled  mercury  cadmium  telluride  (MCT) 
detector.27  All  of  the  spectra  were  collected  at  room 
temperature  (21  °C).  The  OMNIC  software  was 
configured  to  report  absorbance  spectra,  and  absorbance 
baselines  were  fit  to  these  data  using  a  manually  adjusted 
spline. 

Ultraviolet -Visible  Spectroscopy.  A  100  pi  aliquot  of 
Hox  CrHydAi  was  prepared  in  the  glove  box  under  100% 
N2,  transferred  into  low  volume  1  cm  path-length  quartz 
cuvette  and  septum  sealed.  The  02  reaction  was  initiated 
by  injecting  02  with  a  gas-tight  syringe  into  the  cuvette 
with  the  enzyme  solution  under  magnetic  stirring  at  4  C 
and  cooled  with  a  peltier  cooler.  The  reaction  of  CrHydAi 
with  02  was  monitored  by  UV-Vis  every  min  by  scanning 
over  a  250-750  nm  range  at  1  nm  intervals.  UV-Vis  spectra 
were  recorded  on  a  Cary  4000  UV-Visible 
spectrophotometer . 

Activity  Assays.  Activities  of  CrHydAi  were  assayed  by 
H2  evolution  from  reduced  methyl  viologen.  Reaction 
volumes  of  0.6  or  2  ml  were  placed  in  3  or  10  ml  Wheaton 
vials,  respectively,  which  contained  5  mM  methyl 


viologen  (MV),  10  mM  NaDT,  50  mM  Tris,  300  mM  NaCl, 
5%  glycerol,  and  between  25  ng-4  pg  of  enzyme  per  assay. 
Hydrogen  production  was  detected  by  gas 
chromatography  (Agilent  Technologies). 

Mass  Spectrometry.  Protein  digests  were  performed 
with  1.5  mg  ml"1  CrHydAi,  12.5  ug  ml"1  Trypsin  Gold 
(Promega),  50  mM  Tris-HCl  pH  8,  300  mM  NaCl,  and  5% 
glycerol.  Reactions  were  allowed  to  proceed  overnight  in 
a  37  °C  heat  block,  and  complete  digestion  was  verified  by 
sodium  dodecyl  sulfate  polyacrylamide  gel 
electrophoresis  (SDS-PAGE).  Reactions  were  transferred 
into  septum  sealed  vials  with  a  N2  headspace.  The 
digestion  product  was  diluted  up  to  a  1000  fold  in  50/50 
water/Acetonitrile  and  transferred  to  screw  capped  auto 
sampler  vials  for  LCMS. 

An  Agilent  1100  series  HPLC  system  coupled  to  an 
Agilent  Chip  Cube  integrated  microfluidics  reverse-phase 
nano-HPLC  system  was  used.  Trapping  and  analytical 
separations  were  performed  with  an  Agilent  C18  HPLC- 
Chip  (G4240-62001,  40  nl  trap  column  and  75  pm  x  43  mm 
analytical).  Chromatography  solvents  were  H20  with 
0.1%  (v/v)  formic  acid  for  channel  "A"  and  acetonitrile  for 
channel  "B".  The  HPLC  program  was  held  at  7%  B  from 
0.0  to  2.0  min,  then  ramped  from  7  to  35%  B  from  2.0  to 
20.0  min.  The  gradient  was  then  ramped  from  35  to  95% 
B  from  22  to  27  min.  The  mass  spectrometer  was  an 
Agilent  6520  Q-TOF  with  a  dual-ESI  source:  resolution 
approximately  20,000  and  accuracy  3  ppm.  Spectra  were 
collected  in  positive  mode  from  50  to  1700  m/z  at  2  Hz  for 
both  MS  and  MS/MS,  with  adaptive  acquisition  time  for 
highly-abundant  ions. 

The  resulting  MS/MS  data  were  analyzed  using  the 
PEAKS  6.0  software  package  and  searching  the  CrHydAi 
protein  sequence.  Peptide  mass  tolerance  was  10  ppm 
and  fragment  mass  tolerance  0.5  Da.  Variable 
modifications  were  set  to  sulfonic  acid  47.97,  sulfinic  acid 
31.98,  and  oxidation  or  hydroxylation  of  cysteine  15.99. 

Crystallization  and  Structure  Determination. 
CrHydAi  was  crystallized  anaerobically  in  a  MBraun  glove 
box  at  room  temperature  using  micro-capillary  batch 
diffusion  with  a  precipitation  solution  of  25.5% 
polyethylene  glycol  8000  as  precipitate  and  0.085  M 
sodium  cacodylate  (pH  6.5),  0.17  M  sodium  acetate 
trihydrate,  and  1  mM  dithionite.  After  allowing  the 
crystals  to  form  for  2-3  weeks,  they  were  mounted  on 
cryo-loops  and  flash  frozen  in  liquid  nitrogen.  Data  were 
collected  at  the  Stanford  Synchrotron  Radiation 
Lightsource  (SSRL)  on  beam  line  BL12-1  at  1.75  A 
wavelength.  The  data  was  processed  using  XDS,28  and 
scaled  with  Pointless  and  Aimless.29  The  structure  was 
solved  using  molecular  replacement  using  AutoMR  (CCP4 
suite  of  programs)  with  CrHydAi  (PDB  ID,  3LX4).30  The 
structure  was  built  using  COOT  with  further  refinement 
using  REFMAC5  using  NCS  and  B  factor  restraints.  The 
final  model  was  solved  to  2.23  A  with  an  R  factor  of  20.7% 
and  an  R  free  of  25.2%  (Table  Si).  Atomic  coordinates  were 
deposited  in  the  PDB  (code  4ROV). 
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Figure  2.  FTIR  spectra  of  CrHydAi  samples  exposed  to  02. 
The  solid  red  spectra  are  prior  to  02  injection,  and  black 
traces  after  2  h  exposure  to  02.  (a)  Hox,  ~o.oi%  02;  (b)  Hox- 
CO,  -2.4%  02;  (c)  Hred-H2,  -0.01%  02;  (d)  Hred-NaDT,  -0.01% 
02.  Wavenumbers  in  red,  Hox;  black,  Hox-CO;  green,  Hred- 
H2;  magenta  Hred-NaDT;  purple,  oxidative  damage  and  cyan, 
unassigned. 

In  vitro  Reactivation  of  02  Inactivated  CrHydAi. 

Reactivation  of  the  02  inactivated  CrHydAi  was 
performed  with  the  addition  of  N-terminal  His  tagged- 
HydFEC  at  different  molar  ratios  to  obtain  a  300  pi  total 
volume  in  3  ml  crimp  sealed  anaerobic  Wheaton  vials. 
CrHydAi  was  allowed  to  reactivate  at  37  °C  in  a  water 
bath  for  1  h,  and  reactivation  was  followed  by  measuring 
the  H2  production  activity  as  described  above. 


■  RESULTS 

FTIR  Analysis  and  Comparison  of  Hox,  Hred-NaDT, 

Hred-H2  and  Hox-CO.  To  investigate  how  differences  in 
the  oxidation  state  and/or  site  occupancy  of  the  2Fe 
subduster  of  CrHydAi  affect  the  process  of  02  damage, 
reducing  agents  NaDT  and  H2,  and  the  competitive 
inhibitor  CO  were  each  added  to  oxidized  (Hox)  CrHydAi. 
These  enzyme  samples  were  then  exposed  to  02  and 
monitored  by  FTIR.  CrHydAi  was  initially  prepared  in  the 
Hox  state  with  major  peaks  observed  at  1804,  1940,  1964, 
2071  and  2089  cm"1  (Figure  2a,  red).  Exposure  of  the  Hox 
CrHydAi  to  ~o.oi%  02  (Figure  2a  black  spectrum)  for  2  h 
resulted  in  the  loss  of  Hox  features  and  an  increase  in 
features  assigned  to  Hox-CO  and  the  02-damaged  cluster, 
discussed  in  more  detail  in  the  time  course  discussion. 

CrHydAi  in  the  Hox  state  was  exposed  to  CO  to  make 
Hox-CO  (Figure  2b,  red  spectrum).  The  expected  shift  of 
vCO  and  vCN  peaks  to  higher  wavenumbers  were 
observed  as  previously  reported  for  CO-inhibited  [FeFe]- 
hydrogenases,  with  vCO  peaks  at  1809,  1963,  1969,  2013, 
and  vCN  peaks  at  2082,  and  2091  cm"1.31"33  The  enzyme 
was  then  exposed  to  -2.4%  02.  Hox-CO  was  exposed  to 
larger  percentages  of  oxygen  in  order  to  see  if  any 
attenuation  of  FTIR  signal  could  be  observed.  After  2  h  of 
02  exposure,  the  FTIR  spectrum  indicated  the  peaks 
assigned  to  Hox-CO  remained  relatively  unchanged 
(Figure  2b,  red  vs.  black  spectrum).  Small  peaks 
appeared,  which  are  attributed  to  degradation  of  some 
residual  Hox  species,  and  this  may  have  also  contributed 
to  the  slight  increase  in  Hox-CO  signal  intensity  after  02 
exposure.  Enzyme  in  the  Hox-CO  state  was  also  treated 
with  higher  amounts  of  02  (up  to  -24.2%,  Figure  S2),  and 
the  resulting  FTIR  spectra  again  showed  significantly  less 
degradation  compared  to  oxidized  and  reduced 
preparations,  consistent  with  other  observations  that  Hox- 
CO  is  stable  to  oxidative  reactions  with  02.4,10,34 

Equilibration  of  Hox  CrHydAi  under  100%  H2  led  to  a 
collective  shift  in  the  vCO  peaks,  consistent  with  ligand 
exchange  and  electronic  transitions  at  the  H  cluster 
associated  with  H2  activation.27,35  Figure  2c  shows  the 
spectrum  of  H2-reduced  CrHydAi  with  principle  vCO 
peaks,  assigned  to  a  mixed  population  of  reduced 
intermediates,  observed  at  1792, 1882, 1891, 1916, 1933, 1953, 
and  1963  cm'1  (Figure  2c,  red  spectrum).27'35'36  Additional 
peaks  associated  with  Hox-CO  were  also  seen  in  the 
starting  sample,  possibly  arising  from  a  slight  amount  of 
02  exposure  during  H2  treatment.  Exposure  of  Hred-H2 
CrHydAi  to  ~o.oi%  02  for  2  h,  in  the  presence  of  100%  H2 
atmosphere  led  to  attenuation  of  the  1792, 1882, 1891, 1916, 
and  1933  cm"1  peaks  (Figure  2c,  red  vs.  black  spectrum). 
This  was  accompanied  by  an  increase  in  peak  intensities 
assigned  to  Hoxat  1804, 1940,  and  1964  cm"1  together  with  a 
small  increase  in  peak  intensities  assigned  to  Hox-CO  at 
1969  and  2013  cm"1.  Thus,  degradation  of  the  H2-reduced 
CrHydAi  is  likely  to  involve  initial  formation  of  HOXi  with 
subsequent  interaction  with  02  leading  to  formation  of 
Hox-CO  and  eventual  2Fe  subcluster  degradation. 

When  Hox  CrHydAi  was  treated  with  NaDT  (20  mM 
final  concentration,  a  20-fold  excess),  the  initial  spectrum 
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(Figure  2d,  red  spectrum)  showed  vCO  peaks  previously 
assigned  to  reduced  intermediates  at  1861,  1882,  1891, 1916, 
1961,  and  1979  cm'1.27  Compared  to  Hox  (Figure  2a),  the 
NaDT-treated  CrHydAi  showed  less  signal  attenuation 
after  exposure  to  02  (Figure  2d,  red  vs.  black  spectra), 
with  only  a  small  loss  of  2Fe  subcluster  signal  after  2  h  of 
exposure  to  ~o.oi%  C)2.  Regarding  vCO  peak  intensities, 
the  02  treatment  produced  new  peaks  at  1946  cm'1  and 
1969  cm'1,  a  slight  increase  in  peak  intensities  at  1916,  and 
2013  cm'1,  and  larger  increases  in  peak  intensities  at  1804 
and  1940  cm"1.  The  increases  at  1804  and  1940  cm"1  are 
assigned  to  an  increase  in  the  population  of  Hox,  and  the 
changes  at  1969  and  2013  cm"1  are  assigned  to  an  increase 
in  the  population  of  Hox-CO.  The  appearance  of  Flox  is 
similar  to  the  sequence  observed  for  02  exposure  of  H2- 
reduced  CrHydAi,  again  consistent  with  02  inactivation 
involving  at  least  one  Hox-dependent  reaction  pathway. 

FTIR  time  series  of  02  exposed  Hox  CrHydAi.  In 
order  to  observe  02-induced  transitions  in  the  2Fe 
subcluster  by  FTIR  on  the  timescales  of  full  spectral 
collection  (512  scans,  ~8  min),  CrHydAi  prepared  in  the 
Hox  state  (with  vCO  modes  at  1804,  1940,  1964  cm"1)  was 
incubated  under  a  low  (~o.oi%)  02  partial  pressure  at  4 
°C,  and  FTIR  spectra  were  collected  at  approximately  -25 
min  intervals  (Figure  3).  Under  these  conditions,  the  Hox 
signal  gradually  attenuated  and  transitioned  to  Hox-CO, 
with  primary  vCO  peaks  at  1809,  1963,  and  2013  cm"1.  A 
near  complete  transition  of  Hox  to  Hox-CO  was  observed 
after  -200  min  of  02  exposure.  The  decay  rates  of  Hox 
specific  vCO  signals  at  1940  and  1964  cm"1  were  calculated 
from  normalized  absorbance  measurements  to  be 
/<ahox=1o  4  s"1  pM'1  (Table  1).  These  rates  matched  well  to 
the  inactivation  rates  /cinact=io’4  s’1  pM"1  of  H2  evolution 
activity  (Table  1).  We  also  measured  the  change  in  the 
FTIR  spectra  of  Clostridium  acetobutylicum  HydA  (Cal) 
prepared  in  Hox  and  exposed  to  a  ~30-fold  higher 
concentration  (-0.28%)  of  02.  This  enzyme  is  ~ioo-fold 
less  sensitive  to  02  inactivation  than  CrHydAi,4  thus  was 
expected  to  have  slower  kinetics  of  02-induced  changes  in 


Table  1.  Rate  constants  for  02  induced  changes  in 
CrHydAi  and  Cal  Hox  vCO  peak  intensities,  and  H2 
evolution  activities. 


Enzyme 

k AHox 

AIR  signal, 

(s'1  pM"1) 

1 1 

'^inact 

h2 

evolution 

activity 

(s"1  pM'1) 

2Ref. 

^inact 

(s’1  liM’1) 

CrHydAi 

(A1940) 
3.0  x  10"4 

(A1964) 

2.0  X  10  4 

2.0  x  10"4 

4.3  x  10"4  (a) 
2.2  x  10  3  (b) 

Cal 

(A1945) 

-6 

9.0  x  10 

(A1800) 

-6 

1.2  X  IO 

s-  -6 

6.4  X  IO 

5.1  x  10’6  (b) 

1  k=  -[ln(y/y0)]/t  (s  1  pM"1) 

2  (a)  Goldet,  G.,  et  al.  2009.  JACS.  131:14979;  (b)  Stripp, 

S.T.,  et  al.  2009.  PNAS.  106(41) :i733i. 
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Figure  3.  Time  course  of  the  FTIR  spectra  of  Hox  CrHydAi 
exposed  to  ~o.oi%  02.  The  initial  Hox  signal  (1804,  1940, 
1964,  2071,  and  2089  cm"1)  gradually  decays,  and  the 
spectrum  transitions  to  a  characteristic  Hox-CO  signal  with 
vCO  peaks  at  1809,  1963,  1969,  and  2013  cm"1.  Signals 
assigned  to  02  damaged  clusters  are  shown  in  purple. 

FTIR  spectral  features.  A  time-course  spectrum  (Figure 
Si)  of  Cal  exposed  to  -0.28%  02  shows  a  slower  decay  rate 
of  Hox  peak  intensities,  and  subsequent  appearance  of 
Hox-CO  signals,  than  for  CrHydAi  (Table  1).  These 
differences  match  to  the  comparatively  slower  (~ioo-fold) 
inactivation  rate  of  Cal  versus  CrHydAi  (Table  1).  As 
observed  for  CrHydAi  there  is  good  agreement  between 
the  rate  of  loss  of  Hox  signal  intensity,  kAHox  ~  10"6  s"1  pM'1, 
and  the  inactivation  rate  of  H2  evolution  activity  by  02, 

/<inact  =  lO'V  pM"'. 
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Wavenumber  (cm  ') 

Figure  4.  FTIR  spectra  of  Hox  CrHydAi  after  long-term 
exposure  to  0.01%  02.  Scans  taken  at  (a)  133  min,  and  (b)  253 
min,  after  exposure  to  02.  Wavenumber  coloring  is  as 
described  in  Figure  2. 

The  percentage  of  Hox-CO  formed  after  02  exposure  of 
Hox  (change  in  primary  vCO  peaks  from  1804,  1940,  and 
1964  cm"1  to  1809,  1963  and  1969  cm"1)  was  estimated  by 
using  standardized  values  for  vCO  peak  heights, 
normalized  to  mg  of  enzyme,  for  an  anaerobically 
prepared  CrHydAi  Hox-CO  sample  (Figure  2b,  red 
spectrum).  The  amount  of  02-treated  Hox  CrHydAi  that 
converted  to  Hox-CO  was  determined  based  on 
normalizing  the  vCO  peak  intensities  in  the  02-treated 
sample  to  the  Hox-CO  standard.  The  amount  of  CrHydAi 
that  converted  to  Hox-CO  was  -20%  after  133  min  (Figure 
4a)  of  ~o.oi%  02  exposure,  where  the  rest  of  the  signal 
loss  is  likely  due  to  complete  degradation  of  the  2Fe 
subcluster,  which  is  observed  at  253  min  (Figure  4b).  The 
02-induced  Hox-CO  signal  appeared  simultaneously  with 
the  loss  of  Hox,  and  slowly  decreased  in  intensity  after  133 
min  (Figure  3).  After  a  longer  period  of  exposure,  the  loss 
of  Hox-CO  was  accompanied  by  the  appearance  and 
perturbations  of  new  vCO  peaks  at  1825,  1925  and  2025 
cm"1  (Figure  4b).  These  peaks  seem  to  be  specific  for 
oxidatively  damaged  H  cluster,  but  since  the  overall  2Fe 
subcluster  signal  is  degrading  and  other  states  are 
growing  in  (Hox-CO)  and  decaying  (Hox),  it  is  not 
currently  possible  to  determine  whether  these  peaks  arise 
from  a  single  or  multiple  H  cluster  state(s). 

UV-Vis  of  02  exposed  CrHydAi.  Ultraviolet  visible 
(UV-Vis)  spectroscopy  was  employed  to  monitor  the  ^Fe- 
4S]  cluster  during  02  exposure.  The  absorption  spectra  of 
holo-CrHydAi,  CrHydAi  expressed  in  the  absence  of 
HydE,  HydF  and  HydG  and  lacking  the  2Fe  subcluster 
(CrHydAiAEFG),  and  the  02  inactivated  CrHydAi  all 
showed  a  broad  415-420  nm  absorbance  feature  associated 
with  S— >Fe  charge  transfer  bands  of  FeS  clusters. 
CrHydAiAEFG  contains  only  a  [4Fe-4S]  subcluster  inserted 
by  E.  coli’s  FeS  cluster  assembly  machinery.37,38  The 


difference  between  the  spectra  of  CrHydAiAEEG  and  holo- 
CrHydAi  (Figure  5,  red  and  blue  trace,  respectively) 
shows  that  the  spectra  for  CrHydAiAEFG  appears  to  be  red- 
shifted  and  the  difference  in  signal  at  450  nm  accounts  for 
11%  of  the  total  absorbance.  This  absorbance  difference 
could  be  due  to  CrHydAiAEEG  being  in  a  more  oxidized 
state,  differences  in  Fe-loading  per-mol  of  enzyme,  or  the 
presence  of  the  2Fe  subcluster  causing  a  blue  shift  of  the 
450  nm  absorbance  peak  in  holo-CrHydAi. 

Inactivation  of  CrHydAi  by  exposure  to  ~o.ooi%  02  for 
2  h  was  monitored  by  H2  evolution  activity  and  UV-Vis 
spectroscopy.  Since  the  amount  of  enzyme  was  about  10- 
fold  lower  for  UV-Vis  experiments,  10-fold  less  02  was 
added  as  compared  to  FTIR  experiments.  The  initial 
activity  was  500  pmol  min  'mg"1,  which  declined  after  the 
2  h  02  treatment  to  12  pmol  min"1  mg"1.  The  blue  trace  in 
Figure  5  is  the  starting  spectrum  of  holo-CrHydAi,  and 
the  black  trace  is  after  2  h  incubation  with  02.  The 
difference  spectra  between  holo-CrHydAi  (Figure  5a,  blue 
trace)  and  02  inactivated  CrHydAi  (Figure  5a,  black  trace) 
did  not  reveal  significant  differences  (data  not  shown), 
suggesting  that  most  of  the  S— »Fe  charge  transfer 
absorption  is  maintained  after  02  inactivation  This  result 
is  consistent  with  stability  of  the  [4Fe-4S]  cluster  against 
02  damage. 

In-vitro  activation  of  O2  inactivated  CrHydAi. 

CrHydAi  that  had  been  02  inactivated  could  be  in  vitro 
activated  with  HydFEG  (Figure  S3)  and  activity  could  be 
recovered  to  ~8o%  of  as  purified  activity.  The  remaining 
population  of  enzyme  that  was  inactivateable  is  likely  due 
to  degradation  products  that  were  further  degraded  to 
possibly  [3Fe-4S]  cluster,  [2Fe-2S]  cluster,  or  completely 
stripped  of  cluster.39  This  data  suggests  there  is  a  residual 
population  of  enzyme  containing  [4Fe-4S]  after  02 
exposure.  It  had  been  previously  shown  that  HydFEG  is 
sufficient  for  activating  CrHydAiAEEG,26  and  that  activation 
of  CrHydAiAEEG  requires  the  assembly  of  a  preformed 
[4Fe-4S]  cluster.37 

Structural  characterization  of  02  inactivated 
CrHydAi.  CrHydAi  inactivated  by  02  readily  crystallized 
and  the  structure  was  determined  to  2.3A  resolution.  The 
resulting  structure  was  very  similar  to  the  previously 
characterized  structure  of  purified  CrHydAi  expressed  in 
the  absence  of  [FeFe]-hydrogenase  maturases  HydE, 
HydF,  and  HydG  with  an  overall  r.m.s.  deviation  between 
the  two  structures  of  0.29  A.37  The  structure  revealed  a 
vacant  2Fe  subcluster  site  and  an  intact  [qFe^S] 
subcluster  (Figure  5b)  and  an  open  channel  leading  from 
the  surface  to  the  site  left  vacant  in  absence  of  the  2Fe 
subcluster.  This  suggests  that  the  02  inactivated  CrHydAi 
is  the  appropriate  conformation  and  is  consistent  with  the 
observation  that  oxygen  inactivated  CrHydAi  can  be 
reactivated  by  the  H  cluster  maturation  machinery.  In 
addition,  the  structure  revealed  three  Cys  residues  (amino 
acids  88,  169,  and  238)  to  have  additional  electron  density 
around  the  sulfur  atoms  which  can  be  modeled  and 
refined  suggesting  sulfenic  acid  (R-SOH)  at  positions  Cys 
169  (which  functions  in  proton-transfer  to  the  H  cluster40) 
and  Cys  238  (surface  localized)  and  sulfinic  acid 
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Figure  5.  Crystal  structure  and  UV-Vis  of  02  inactivated  CrHydAi.  (a)  Electron  density  map  contoured  at  2a  showing  extra 
electron  density  in  the  H  cluster  environment  including  oxidized  Cys  169.  The  conserved  [FeFe]-hydrogenase  motifs  are 
depicted  below  the  ribbon  diagram  with  Cys  169  (starred),  (b)  The  full  spectrum  of  reduced  immature  CrHydAiAEFG  (— ),  and 
of  holo-CrHydAi  (— )  after  inactivation  with  0.001%  02  2  h  (— ),  or  with  3  atm  of  100%  02  for  3  h  (— ).  (c)  A  close-up  view  of 
the  350-550  nm  region. 


(R-SOOH)  at  position  Cys  88  (surface  localized).  No 
modifications  of  the  four  Cys  residues  that  function  to 
coordinate  the  H  cluster  (Cys  170,  225,  417,  and  421)  were 
detected.  The  extra  electron  density  at  Cys  169  is  best 
explained  as  a  reaction  product  of  the  sulfur  group  with 
ROS,  which  might  arise  from  02  binding  at  the  2Fe 
subcluster  as  indicated  by  the  FTIR  results.  It  is  possible 
that  the  modifications  of  the  surface  Cys  88  and  238 
might  arise  from  diffusion  of  ROS  out  of  the  catalytic  site. 
MS  analysis  confirmed  the  presence  of  sulfenic  and 
sulfinic  acid  at  the  Cys  positions  observed  in  the  X-ray 
structure  (data  not  shown).  Although  02  damage  is 
evident  from  the  X-ray  and  analytical  structures,  the  [qFe- 
4S]  subcluster  was  intact  with  no  evidence  of  O2  damage. 

■  DISCUSSION 

The  results  of  recent  studies  in  which  X-ray  Absorption 
Spectroscopy  (XAS)  was  used  to  monitor  02  damage  of 
CrHydAi  were  interpreted  to  indicate  that  after  02 
binding  and  reaction  with  the  2Fe  subcluster,  the  initial 
target  of  H  cluster  degradation  was  the  [4Fe-4S] 
subcluster.10’23  These  studies  were  conducted  with  high 
concentrations  of  02  and  in  one  study  with  the  additional 
presence  of  high  concentrations  of  NaDT.  For  example, 
Stripp  et  at.  exposed  -  30  nmol  of  CrHydAi  equilibrated  in 
the  Hox  state  to  -20%  02over  a  period  of  15  min,10  whereas 
in  Lambertz  et  al.  exposed  8  nmol  of  NaDT  reduced 
CrHydAi  to  air  saturated  buffer  (--330  pM  or  21%  02)  for 
-17  min.23  In  contrast,  for  the  comparative  study  with 
FTIR  we  monitored  Hox  2Fe  subcluster  degradation  with 
slightly  more  enzyme  (68  nmol),  but  exposed  to  --2500X 
less  (--0.137  pM  in  solution  as  determined  by  Henry’s  Law 
or  ~o.oi%  headspace  partial  pressure)  concentrated  02  in 


solution  over  a  ~17X  longer  time  period.  Similar 
experimental  setups  were  done  to  also  allow  the  use  of 
additional  analytical  techniques  (biochemical  assays,  and 
UV-Vis  spectroscopies,  mass  spectrometry,  and  X-ray 
crystallography)  to  follow  the  fate  of  the  2Fe  subcluster 
and  [4Fe-4S]  subcluster  during  the  inactivation  process. 

FTIR  analysis  of  CrHydAi  incubated  in  the  presence  of 
~o.oi%  02  in  the  absence  of  exogenously  added  reducing 
agents  exhibited  attenuation  of  the  Hox  state.  The 
attenuation  of  the  Hox  signal  tracked  nearly  1:1  with  loss  of 
enzyme  hydrogen  production  activity  and  is  strong 
evidence  that  the  2Fe  subcluster  of  the  H  cluster  is  the 
initial  site  of  02  inactivation.  In  addition  to  the  loss  of 
Hox  signal  due  to  O2,  signals  commonly  associated  with 
Hox-CO  state  appeared.  This  observation  suggests  that  02 
degrades  the  2Fe  subcluster,  liberating  CO,  and  that  the 
proportion  of  H  cluster  that  transitions  to  the  Hox-CO 
state  is  a  result  of  the  free  CO  binding  to  remaining  intact 
clusters.  CO  liberation  and  CO  binding  to  intact  enzyme 
in  Hox  is  also  thought  to  occur  during  photoillumination 
of  the  [FeFe]-hydrogenases.31  These  observations  are 
consistent  with  free  CO  being  a  potent  inhibitor  (k,~o.i 
pM)4  of  [FeFe]-hydrogenases,  and  02  liberating  CO  that 
binds  free  Hox  is  consistent  with  the  prior  EPR  studies  of 
02  treated  CpI  showing  formation  of  a  Hox-CO  axial  EPR 
signal.41 

The  changes  that  occur  when  CrHydAi  is  exposed  to  02 
appear  to  occur  to  a  greater  extent  and  at  a  higher  rate 
when  poised  in  the  Hox  state.  Enzyme  poised  in  Hox-CO, 
Hred-H2,  and  Hred-NaDT  were  less  prone  to  02-dependent 
decay.  CrHydAi  equilibrated  under  H2  or  NaDT  reduced 
also  showed  limited  degradation  compared  to  Hox,  with  a 
majority  of  2Fe  subcluster  signal  being  maintained  at  1.2% 
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to  2%  02  respectively  after  2  h  (data  not  shown).  The 
data  suggests  that  an  H  cluster  that  is  coordinately 
saturated  with  an  occupied  distal  Fe  ligand  exchangeable 
site  is  more  resistant  to  02  damage  and  provide  strong 
support  of  this  site  as  the  initial  site  of  O,  binding  and 
attack  along  the  pathway  of  H  cluster  degradation.4,27’34,35 
The  addition  of  CO  had  the  greatest  protective  effect  with 
no  observable  decay  of  Hox-CO  up  to  24.2%  02  (Figure  S2) 
over  2  h  of  exposure,  consistent  with  previous 
electrochemical  observations.4,10  The  Fl2  and  NaDT 
treated  CrHydAi  exposed  to  02  initially  transitions  into 
Hox  prior  to  degradation  indicated  by  the  observed 
appearance  of  the  1940  cm"1  feature.  Subsequently, 
similar  degradation  products  are  observed  in  the  NaDT- 
and  Fl2-treated  CrFlydAi  samples  as  were  observed  with 
Hox  samples,  with  the  transient  appearance  of  Hox-CO 
specific  FTIR  features. 

The  02  inactivated  CrHydAi  was  capable  of  being 
reactivated  by  the  addition  of  HydFEG  suggesting  that  the 
inactivated  enzyme  has  a  damaged  or  absent  2Fe 
subcluster.  Further,  as  described  above,  both  UV-Vis 
spectroscopy  and  structural  characterization  indicated 
the  presence  of  an  intact  [4Fe-4S]  subcluster  that  was 
stable  long  after  activity  attenuated.  Interestingly,  the  X- 
ray  crystal  structure  and  UV-Vis  of  02  inactivated 
CrHydAi  strongly  resembled  CrHydAi  expressed  in  the 
absence  of  maturases  (CrHydAiAEFG).  Previous  studies 
probing  H  cluster  degradation  using  comparatively  higher 
concentrations  of  02  either  with23  or  without10  NaDT 
proposed  that  the  degradation  of  the  [4Fe-4S]  subcluster 
preceded  2Fe  subcluster  degradation,  whereby  ROS  was 
generated  by  02  binding  and  reaction  at  the  2Fe 
subcluster.  Under  the  conditions  of  our  study  in  which 
CrHydAi  was  exposed  to  02  in  the  absence  of  NaDT,  the 
[4Fe-4S]  subcluster  seems  fairly  resistant  to  degradation. 
We  do  however  see  evidence  for  the  oxidation  of  the  non¬ 
coordinating  active  site  Cys  (Cys  169)  perhaps  through  the 
formation  of  ROS.42  Exposure  of  either  the  02 
degradation  intermediate  observed  here  or  CrHydAiAEEG 
to  high  concentrations  of  02  resulted  in  the  eventual 
destruction  of  the  [4Fe-4S]  subcluster,  as  evidenced  by 
reduction  and  eventual  loss  of  the  S— »Fe  charge  transfer 
bands  at  -420  nm. 

■  CONCLUSION 

The  inactivation  of  [FeFe]-hydrogenase  by  02  is  defined 
by  steps  that  involve  the  diffusion  of  gases  into  close 
proximity  of  the  catalytic  site,  followed  by  redox/chemical 
steps  of  02  reaction  with  the  H  cluster.  Our  results  are 
consistent  with  theoretical  models19  and  indicate  that  the 
latter  process  proceeds  via  oxidative  breakdown  of  the 
2Fe  subcluster,  initially  releasing  CO  that  can  bind  to 
secondary  targets  (e.g.,  other  enzymes).  Further  exposure 
results  in  the  eventual  formation  of  a  stable  break-down 
product  with  an  intact  [4Fe-4S]  subcluster  and  a  vacant 
2Fe  subcluster  site  capable  of  being  reactivated  by  2Fe 
subcluster  specific  maturation  machinery.  Long-term 
and/or  high  concentration  02  exposure  is  required  for 
oxidative  damage  of  the  [4Fe-4S]  subcluster  and  complete 


H  cluster  degradation  as  evidenced  by  the  previous 
XAS/EXAFS  studies.10  23  Based  on  our  results,  [FeFe]- 
hydrogenase  inactivation  from  02  exposure  essentially 
reverses  the  maturation  pathway  of  H  cluster  insertion 
into  immature  CrHydAi,  and  suggests  that  H  clusters 
inactivated  by  low  concentration  02  exposure  in  vitro 
could  be  substrates  for  reactivation  by  HydFEG. 
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ABSTRACT:  Proton-coupled  electron  transfer  (PCET)  is  a  fundamental  process  at  the  core  of  oxidation-reduction  reac¬ 
tions  for  energy  conversion.  The  [FeFe]-hydrogenases  catalyze  the  reversible  activation  of  molecular  H,  through  a  unique 
metallocofactor  (H-cluster)  that  is  finely  tuned  by  the  surrounding  protein  environment  to  undergo  fast  PCET  transitions. 
The  coordination  of  electronic  and  structural  transitions  at  the  H-cluster  with  proton-transfer  (PT)  steps  has  not  been 
closely  examined  experimentally.  Here,  we  explore  how  disruption  of  the  conserved  PT  network  via  a  Cys-to-Ser  mutation 
proximal  to  the  H-cluster  of  Chlamydomonas  reinhardtii  [FeFe]-hydrogenase  (CrHydAi)  affects  the  H-cluster  using  elec¬ 
tron  paramagnetic  resonance  (EPR)  and  Fourier  transform  infrared  (FTIR)  spectroscopy.  Despite  a  substantial  decrease  in 
catalytic  activity,  the  EPR  and  FTIR  spectra  reveal  different  H-cluster  catalytic  states  under  reducing  and  oxidizing  condi¬ 
tions.  Under  H2  or  sodium  dithionite  reductive  treatments,  the  EPR  spectra  show  signals  that  are  consistent  with  a  re¬ 
duced  [4Fe-4S]H1+  subcluster.  The  FTIR  spectra  showed  upshifts  of  vCO  modes  to  energies  that  are  consistent  with  an  in¬ 
crease  in  oxidation  state  of  the  diiron  subcluster,  which  was  corroborated  by  Density  Functional  Theory  analysis.  In  con¬ 
trast  to  wild-type  CrHydAi,  spectra  associated  with  Hrcd  and  Hsred  states  are  less  populated  in  the  Cys— >Ser  variant, 
demonstrating  that  the  exchange  of  -SH  to  -OH  alters  how  the  H-cluster  equilibrates  among  different  reduced  states  of 
the  catalytic  cycle  under  steady-state  conditions. 


INTRODUCTION 

Hydrogen  (HJ  gas  is  an  appealing  energy  carrier  for  the 
future,  which  has  inspired  efforts  to  develop  biological 
based  approaches  for  renewable  H2  production.1’2  The 
[NiFe]-  and  [FeFe]-hydrogenases  are  enzymes  that  cata¬ 
lyze  the  reversible  activation  of  molecular  H2  through  the 
reaction,  H2  ±5  2H+  +  2e-.  Because  they  accomplish  this 
reaction  near  the  thermodynamic  potential  of  the  H2/H+ 
couple3  at  high  rates  using  base  transition  metals,  the 
active-sites  of  these  enzymes,  and  the  mechanisms  of  H2 
activation,  provide  models  to  help  guide  synthetic  ef¬ 
forts.4"9 

The  catalytic  site  H-cluster  of  [FeFe]-hydrogenase  con¬ 
sists  of  two  subclusters,  [qFe-qSG  and  [2Fe]H,  which  are 
linked  through  a  conserved  cysteine  (Cys)  residue.10’11  The 
[2Fe]H  subcluster  is  ligated  by  strong  field  CO  and  CN" 
ligands  along  with  a  bridging  azadithiolate  (adt) 
ligand.12’13  Altogether,  the  H-cluster  is  designed  to  carry 
out  H2  activation  via  proton-coupled  electron  transfer 
(PCET)  reactions;  the  [qFe-qSG  functions  as  an  electronic 
relay14"17  between  either  accessory  FeS  clusters  (F-clusters) 


or  external  redox  carriers,18  and  adt  functioning  to  shuttle 
protons  between  PT  networks  and  [2Fe]H19’20’13  during  H2 
activation  and  proton  reduction.21"23  Mechanistic  details 
on  how  PCET  steps  are  accomplished  in  [FeFe]- 
hydrogenases  are  not  fully  described,  thus  there  is  a  need 
to  closely  examine  how  active  site  chemistry  is  functional¬ 
ly  coupled  to,  and  controls,  the  H-cluster  electronic  tran¬ 
sitions  during  catalysis. 

Several  proton-transfer  (PT)  pathways  connecting  the 
protein  surface  to  the  H-cluster  in  [FeFe]-hydrogenases 
have  been  proposed  based  on  x-ray  structure  and  bio¬ 
chemical  analysis. 10,n’24’25  The  importance  of  PT  for  effi¬ 
cient  H2  activation  has,  in  general,  been  shown  by  incor¬ 
poration  of  proton  relays  in  molecular  catalysts  that  are  a 
critical  design  component  for  improving  catalytic  perfor¬ 
mance.26  In  [FeFe]-hydrogenase  the  primary  PT  channel 
consists  of  coordinated  water  molecules  that  form  an  H- 
bond  network  with  conserved  residues  (e.g.,  Glu,  Ser).27 
Pathway  models  show  that  protons  are  transferred  along 
this  pathway  to  either  one  of  two  strictly  conserved  resi¬ 
dues,  Cys  or  Lys,  that  are  within  H-bond  distance  of  the 
H-cluster. 25,28,29  The  H-cluster  proximal  Cys  in  CrHydAi 
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(Cysi69)  is  within  H-bonding  distance  of  the  adt  ligand 
and  modeled  to  function  in  PT  to  the  bridgehead  amine27" 
29  (Figure  i).  Mutagenesis  studies  demonstrate  that  this 
Cys  residue  is  critical  for  full  catalytic  activity,  and  ex¬ 
change  of  the  -SH  group  of  Cys  to  a  more  basic  -OH 
group  of  Ser  or  nonpolar  side  chains  like  Ala  and  lie  leads 
to  severe  reduction  in  H2  evolution  and  H2  uptake  activi¬ 
ty.27,30"32  The  Lys  stabilizes  H-cluster  binding  through  H- 
bonding  with  the  terminal  CN"  ligand10,33  and  can  also  be 
modeled  to  promote  H-transfer  directly  to  the  Fe  atoms 
of  [2Fe]H-34,22  Substitutions  of  the  conserved  Lys  result  in  a 
more  severe  defect  and  were  found  to  lack  a  [2Fe]H  sub¬ 
cluster,  consistent  with  the  function  of  H-bonding  to  CN" 
31 


Fe^  ;^‘Fe 

ss?  Y  %>cn 


Figure  1.  H-cluster  schematic  and  primary  conserved 
proton  transport  pathway  in  [FeFe]-hydrogenase.  Num¬ 
bering  of  amino  acids  is  based  on  the  sequence  of  CrHydAi 
and  the  secondary  structure  is  from  PDB  ID  3LX4.35  Midpoint 
potentials  for  defined  redox  states36,37,14  are  from  DdH  (Hinact, 
Htrans.  Hox,  and  Hred)38  and  CrHydAi  (Hsred).39  Oxidation 
states  of  the  Fe  atoms  of  [2Fe]H  are  shown  adjacent  to  the 
diamond  which  represents  the  [qFe-qSjn  subcluster. 


An  initial  study  of  a  Cys-to-Ser  substitution  in  CrHydAi 
(Cysi69Ser)  and  Clostridium  acetobutylicum  (Cal, 
Cys298Ser)  result  in  a  complete  loss  of  catalytic  activity, 
which  was  associated  with  alterations  in  the  H-cluster 
spectroscopic  properties.31  Though  the  H-cluster  was  fully 
intact,  the  EPR  and  FTIR  spectra  indicated  it  was 
“trapped”  in  a  state  with  properties  similar  to  Htrans  (Fig¬ 
ure  1)  that  had  been  only  previously  observed  in  the 
[FeFe]-hydrogenase  from  Desulfovibrio  desulfuricans 
(DdH).31  This  state  is  transiently  formed  during  reductive 
activation  (e.g.,  under  H2  or  electrochemical 
reduction)40,41,36,37,42  of  the  oxidized  inactive  form  of  DdH 
(Figure  i).43-20'38,39  The  observation  that  Cysi69Ser  was  able 
to  equilibrate  into  Htrans  was  based  on  the  Cys— >Ser  sub¬ 
stitution  resulting  in  an  inactive  enzyme,  and/or  was  due 


to  a  modification  of  the  H-cluster  redox  properties  and  H2 
binding  environment.31 

Amino  acid  substitutions  that  disrupt  PT  pathways  can 
be  used  to  explore  in  more  depth  how  H-cluster  electron¬ 
ic  transitions  are  coupled  to  protonation  steps  involved  in 
H2  activation.  Here  we  examine  effects  of  a  Cysi69Ser 
substitution  in  CrHydAi  under  steady-state  conditions  by 
EPR  and  FTIR  spectroscopies,  and  evaluated  the  electron¬ 
ic  properties  of  this  variant  by  complementary  Density 
Functional  Theory  (DFT)  calculations.  Our  results  show 
that  spectral  features  assigned  to  different  oxidation 
states  of  WT  CrHydAi15  are  also  observed  in  the  Cysi69Ser 
variant,  however,  at  significantly  altered  intensities.  Un¬ 
der  reduction,  Cysi69Ser  exhibited  EPR  spectra  consistent 
with  the  presence  of  a  [4Fe-4S]H1+  subcluster  (Figure  1)39,14 
that  strongly  correlated  with  higher  frequency  vCO  peaks 
in  the  accompanying  FTIR  spectra.  DFT  analysis  of  a 
Cysi69Ser  H-cluster  model  complex  were  used  to  calcu¬ 
late  IR  vibrational  frequencies  of  CO/CN"  ligands  and  to 
help  assign  the  oxidation  state  of  [2Fe]H-  Isotope  and  ex¬ 
ogenous  CO  binding  effects  on  the  spectroscopic  signals 
were  further  explored  to  help  characterize  the  nature  of 
H-cluster  intermediates. 

EXPERIMENTAL  METHODS 

Expression  Plasmids  and  Site-directed  Mutagene¬ 
sis.  The  pETDuet  based  plasmid  constructs  for  co¬ 
expression  of  either  CrHydAi  from  C.  reinhardtii  or  Cal 
from  C.  acetobutylicum  along  with  Hyd  maturases  ( hydE , 
hydF,  hydG)  from  C.  acetobutylicum  have  been  described 
previously.44,45  Site-directed  mutagenesis  of  the  Cys  resi¬ 
due  implicated  in  PT  in  both  CrHydAi  (Cysibq)  and  Cal 
(Cys298)  were  carried  out  using  the  XL  site-directed  mu¬ 
tagenesis  kit  (Stratagene).  Plasmid  constructs  were  con¬ 
firmed  by  DNA  sequencing  before  transformation  into 
Escherichia  coli  NovaBlue  (Novagen)  for  propagation. 

Cysi69Ser  CrHydAi  and  Cys298Ser  Cal  Expression 
and  Purification.  Expression  and  purification  of 
Cysi69Ser  CrHydAi  and  Cys298S  Cal  were  carried  out 
with  slight  modifications  to  a  previously  described  proce¬ 
dure.45  Protein  expression  was  induced  by  addition  of 
IPTG  (1.5  mM  final)  and  the  following  supplements  were 
added  sequentially:  cysteine  (2  mM  final),  ammonium 
ferric  citrate  (4  mM  final),  glucose  (0.5%  v/v  final),  and 
sodium  fumarate  (10  mM  final).  Induction  was  carried  out 
in  2  L  narrow  necked  flasks  (Kimax,  Kimble)  sealed  with 
rubber  stoppers  and  proceeded  for  12-16  h  at  30  °C  under  a 
continuous  argon  sparge  with  an  outlet  needle  (18  gauge). 
Cells  were  broken  using  a  French  Press  (American  In¬ 
strument  Company)  at  1,000  psi.  Purification  over  DEAE 
Sepharose  Fast  Flow  (GE  Healthcare)  and  Strep-Tactin 
Superflow  high  capacity  (IBA)  resins  were  carried  out  in  a 
LABmaster  Glove  Box  Workstation  (Mbraun)  using  an¬ 
aerobic  buffers  supplemented  with  NaDT  (5  mM  final). 
The  H2  evolution  activity  of  the  purified  enzymes  was 
measured  by  addition  of  a  methyl  viologen  solution  (5 
mM  final)  reduced  with  NaDT  (10  mM  final).  Higher  con- 
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centrations  of  methyl  viologen  (10,  20,  40  mM  final)  and 
NaDT  (20,  40,  80  mM  final)  were  also  used  in  some  cases 
for  further  analysis.  Purified  Cysi69Ser  CrHydAi  was 
stored  at  4  °C  in  50  mM  Tris,  pH  8.0,  300  mM  NaCl,  5% 
glycerol,  5  mM  NaDT  (buffer  A)  and  Cys298Ser  Cal  was 
stored  at  4  °C  in  50  mM  Tris,  pH  8.0,  200  mM  NaCl,  5% 
glycerol,  5  mM  NaDT  (buffer  B). 

[FeFe]-hydrogenase  Sample  Preparation.  All  samples 
were  prepared  in  an  Mbraun  glove  box  under  a  nitrogen 
atmosphere.  Samples  were  concentrated  using  a  10 
MWCO  Microcon  centrifugal  filter  device  (Millipore).  As- 
isolated  samples  were  either  prepared  in  buffer  A 
(Cysi69Ser  CrHyAi)  or  Buffer  B  (Cys298Ser  Cal).  Reduced 
samples  were  prepared  in  buffer  A  with  either  additional 
NaDT  (50  mM  final)  or  100%  UHP  H2  (General  Air,  Den¬ 
ver,  CO  USA)  flush  for  10  cycles  on  a  Schlenk  line.  The  H2 
inlet  had  an  02  trap  (Big  02  Trap,  Agilent  Technologies, 
Inc.,  Santa  Clara,  CA  USA)  between  the  gas  cylinder  and 
the  Schlenk  line.  For  the  H2  sample,  high  pres¬ 
sure/vacuum  EPR  tubes  (Wilmad-LabGlass)  were  used 
and  samples  were  frozen  under  a  slight  over-pressure  of 
H2.  FTIR  of  samples  treated  with  100%  H2  were  prepared 
by  a  similar  procedure  and  a  septum-sealed  0.3  mL  coni¬ 
cal  vial  containing  a  10  pL  aliquot  of  enzyme  was  flushed 
with  H2  on  a  Schlenk  line  for  10  cycles  while  it  was  kept 
on  ice.  The  sample  was  then  equilibrated  for  2  h  at  4  °C  in 
the  Mbraun  glove  box  prior  to  obtaining  the  FTIR  spec¬ 
trum.  Auto-oxidized,  defined  previously,15  samples  were 
prepared  by  exchange  using  a  G-25  column  (GE 
Healthcare)  into  buffer  A  prepared  in  the  absence  of 
NaDT  followed  by  incubation  for  7  to  14  days  at  4  °C  in 
the  Mbraun  glove-box.  In  some  a  cases  oxidized  methyl 
viologen  was  added  (2  mM  final)  to  achieve  more  com¬ 
plete  formation  of  Hox.  As-isolated,  NaDT,  and  auto- 
oxidized  samples  were  treated  with  100%  CO  gas  by  ei¬ 
ther:  sparge  for  30  s,  5  min  incubation  on  ice,  sparge  for  15 
s,  and  a  final  10  min  incubation  on  ice;  or  by  10  vacu¬ 
um/100%  CO  flush  cycles  on  a  Schlenk  line.  Typical  sam¬ 
ple  concentrations  ranged  from  12  -  20  mg/mE  and  from 
100  -  150  mg/mL  for  higher  concentrated  samples. 

For  isotope  studies,  the  NaDT/H20  sample  was  pre¬ 
pared  in  buffer  A  with  additional  NaDT  (100  mM  final; 
final  protein  concentration:  150  mg/mL)  and  equilibrated 
for  1  h  at  4  °C  in  the  MBraun  glove  box  prior  to  obtaining 
the  IR  spectrum.  The  NaDT  reduced  sample  in  D20  was 
prepared  by  G-25  column  exchange  into  D,0  buffer  of  50 
mM  Tris,  pH  8.0,  300  mM  NaCl,  5%  glycerol  with  NaDT  at 
100  mM  (final  protein  concentration,  112  mg/mL).  The 
sample  was  equilibrated  for  30  min  at  room  temperature 
in  the  MBraun  glove  box  prior  to  obtaining  the  FTIR  spec¬ 
trum.  The  H2/H20  was  prepared  by  treating  as-isolated 
Cysi69Ser  (120  mg/mL)  with  100%  H2  by  10  flush  cycles  on 
a  Schlenk  line.  The  sample  was  in  a  septum-sealed  conical 
vial  and  kept  on  ice  during  treatment  and  was  followed  by 
incubation  for  2  h  at  4  °C  in  the  refrigerator  in  the 
MBraun  glove  box  prior  to  collecting  the  FTIR  spectrum. 
The  D2/D20  sample  (124  mg/mL)  was  prepared  by  G25- 


column  exchange  into  D,0  buffer  of  50  mM  Tris,  pH  8.0, 
300  mM  NaCl,  5%  glycerol.  After  5  days  equilibration  at  4 
°C,  the  sample  was  loaded  into  a  septum-sealed  conical 
vial  and  sparged  for  1  min  with  D2  (99.7%  D2,  Matheson 
Tri-Gas,  Newark,  CA  USA).  An  outlet  needle  prevented 
over  pressurization  and  allowed  for  gas  flow.  After  sparg¬ 
ing,  the  sample  equilibrated  overnight  (-24  h)  at  4  °C  in 
the  MBraun  glove  box  prior  to  obtaining  the  FTIR  spec¬ 
trum. 

EPR  Spectroscopy.  EPR  spectra  were  recorded  on  a 
Bruker  ELEXSYS  E500  CW  spectrometer  system  outfitted 
with  an  Oxford  Instruments  liquid  helium  cryostat 
(ESR900),  automatic  transfer  line  (LLT750),  temperature 
controller  (ITC503)  and  cylindrical  Bruker  resonator 
(SHQ).  The  magnetic  field  was  calibrated  with  a  2,2- 
diphenyl-i-picrylhydrazyl  (DPPH)  standard  from  Bruker. 
Temperatures  of  recorded  spectra  ranged  from  ±2  K.  Sim¬ 
ulations  were  carried  out  in  Easy  Spin.46  Double  integra¬ 
tion  of  the  spectra  was  carried  out  in  Origin  (OriginLab) 
for  determination  of  relative  intensities,  and  fractional 
contributions  of  the  different  signals  comprising  the 
overall  spectra.  Analysis  of  the  power  saturation  data  was 
prepared  by  plotting  log  (S/a/P)  against  log  P,  where  S  is 
signal  amplitude  and  P  is  power.  In  these  plots,  the  line 
slopes  towards  the  abscissa  with  increasing  concentra¬ 
tion.47'48  Power  saturation  data  collected  at  11  K  were  fitted 
to  the  equation  S  =  VP/ (i+P/  Py3)° 5i,  where  Py2  is  that  half- 
saturation  power  and  b  is  the  inhomogeneity  parameter. 

FTIR  Spectroscopy.  FTIR  spectra  were  recorded  on  a 
Nicolet  6700  FTIR  Spectrometer  (Thermo  Fisher  Scien¬ 
tific).  The  spectrometer  was  purged  with  N2  gas  passed 
through  a  Drierite-filled  glass  desiccant  tube  and  two 
miniature  desiccant  air  dryers  (Twin  Tower  Engineering) 
with  pre-  and  post-desiccant  filters.  The  spectrometer 
uses  a  Globar  IR  source,  with  a  CaF2  beam  splitter  and  a 
liquid-nitrogen  cooled  mercury-cadmium  telluride  (MCT) 
detector. 

The  custom  sample  cell  consisted  of  a  set  of  CaF2  win¬ 
dows  with  a  spacer  with  a  path-length  of  approximately  12 
pm.  The  sample  volume  was  approximately  5.0  pL.  A  final 
spectrum  consisted  of  512  scans  at  2  cm'1  resolution,  col¬ 
lected  by  the  OMNIC  software  program  (Thermo  Fisher 
Scientific).  Buffer  solution  served  as  the  reference  spec¬ 
trum,  where  appropriately  treated  buffer  solution  con¬ 
taining  protein  served  as  the  sample  spectrum.  All  spectra 
were  collected  at  room  temperature  (21  °C).  The  OMNIC 
software  was  configured  to  report  absorbance  spectra, 
and  the  final  format  for  the  figures  was  generated  using 
manually  adjusted  spline  baseline  fitting.  Occasionally, 
spurious  water  absorption  peaks  would  be  seen  due  to 
slight  changes  in  water  vapor  concentration  in  the  N2 
purge  gas.  These  water  absorption  peaks  could  be  re¬ 
moved  using  a  built-in  atmospheric  suppression  algo¬ 
rithm  in  OMNIC,  or  with  manual  subtraction  of  a  water 
vapor  spectrum. 
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Computational  Model  of  CrHydAi.  The  starting 
structure  for  the  DFT  calculations  was  based  on  the  x-ray 
geometry  of  the  Desulfovibrio  desulfuricans  [FeFe]- 
hydrogenase  (PDB  ID  lHFe)11  in  which  the  water  molecule 
bridging  Fed  and  Fep  (Figure  1),  was  replaced  by  a  CO 
group,  and  the  bridging  propanedithiolate  (pdt)  ligand 
was  replaced  by  adt. 12,13  Calculations  were  carried  out  on 
models  of  WT  CrHydAi  and  the  Cysi69Ser  variant,  in 
which  the  -SH  group  of  Cysi69  was  replaced  with  the  -OH 
group  of  Ser  (Figure  Si). 

A  main  difference  between  CrHydAi  and  DdH  is  the 
lack,  in  the  former,  of  an  accessory  domain  containing  the 
conserved  cysteine-rich  motifs  for  the  binding  of  the  two 
[4Fe-4S]  auxiliary  F-clusters. 11,33  However,  the  residues 
forming  the  hydrophobic  pocket  surrounding  the  active 
site,  as  well  as  the  residues  involved  in  the  putative  PT 
chain,  are  well  conserved  among  all  [FeFe]- 
hydrogenases. 27-29  In  addition,  vCO  and  vCN  stretching 
frequencies  of  well-characterized  states,  such  as  Hox  and 
Hox-CO  of  CrHydAi,  CpI  and  DdH  are  similar,  indicating 
that  the  structural  differences  in  H -cluster  environments 
between  these  enzymes  do  not  significantly  affect  vCO 
and  vCN  IR  frequencies.49  The  computational  model, 
which  is  schematically  shown  in  Figure  Si,  includes  the 
bimetallic  [2Fe]H  cluster,  and  selected  residues  of  the  se¬ 
cond  coordination  sphere.  In  the  following  discussion,  the 
residues  are  numbered  according  to  the  sequence  of 
CrHydAi. 

The  cysteine  residue  linking  [4Fe-4S]H  to  [2Fe]H  (Cys42i 
in  CrHydAi)  was  truncated  at  the  Cot  atom.  The  [4Fe-4S]H 
cluster  in  the  oxidized  (+2)  or  reduced  (+1)  forms  was 
modelled  by  a  thiol,  or  thiolate  ligand,  respectively,  as  we 
showed  that  protonation  of  the  sulphur  atom  of  Cys42i 
mimics  coordination  of  the  [4Fe-4S]H  2  cluster,  whereas  a 
thiolate  better  represents  coordination  of  the  [qFe-qSjn 
cluster  in  its  +1  redox  state.50 

The  second  coordination  sphere  of  the  metal  cluster  in¬ 
cludes  side  chains  of  residues  Cysi69/Seri69,  Lys228, 
Seri93,  GIU231,  Phe29o,  Met223,  Seri33,  and  GIU141,  and 
selected  backbone  atoms  of  Proi94,  Glni95,  Pro93  and 
Alayq.  The  truncated  residues  were  saturated  with  hydro¬ 
gens.  The  model  also  contains  a  water  molecule  H- 
bonded  to  Cysi69/Seri69  and  GIU141  as  a  part  of  the  PT 
chain. 10,11,24,25  A  detailed  list  of  the  atoms  composing  the 
model  is  shown  in  Table  Si.  Notice  that  when  states  fea¬ 
turing  a  protonated  bridgehead  amine  are  taken  into  ac¬ 
count,  the  latter  group  behaves  as  a  H-bond  donor,  the 
acceptor  being  Cysi69/Seri69;  the  chain  of  hydrogen 
bonds  then  extends  up  to  GIU141,  the  side  chain  of  which 
attains  the  anionic  COO-  state  in  this  case  ( see  Figure  Si). 
On  the  other  hand,  when  the  bridgehead  amine  is  in  the 
deprotonated  (neutral)  form  in  the  model,  the  direction¬ 
ality  of  hydrogen  bonds  in  the  chain  is  inverted,  which 
calls  for  the  introduction  of  GIU141  residue  in  its  neutral 
(COOH)  form.  During  geometry  optimizations,  selected 
atoms  were  kept  fixed  at  their  crystallographic  positions, 


in  order  to  avoid  unrealistic  distortions  at  the  boundary  of 
the  model.  These  atoms  are  also  indicated  in  Table  Si. 

Lys228,  GIU231  and  Aspi3  were  included  in  the  model 
since  they  form  a  network  of  H-bonds,  which  ends  at  one 
CN-  ligand.  Amide  hydrogens  of  Ala94  and  Glni95  also 
form  H-bonds  with  the  two  CN-  ligands.  Met223  and 
Phe290  are  in  the  hydrophobic  pocket  surrounding  the 
active  site,  and  they  can  be  crucial  in  favoring  the  con¬ 
formation  of  the  [2Fe]H  cluster  with  a  bridged  CO.  Finally, 
Cysi69  (Seri69),  GIU141,  Seri33  and  the  water  molecule  are 
involved  in  the  PT  pathway  to  shuttle  protons  from/to  the 
bridgehead  amine. 

All  residues  included  in  the  model  are  conserved  in 
CrHydAi  and  DdH  as  well  as  in  other  [FeFe]- 
hydrogenases51  emphasizing  the  possible  relevance  of  the¬ 
se  residues  in  the  catalytic  mechanism  of  the  enzyme.  An 
exception  is  Glni95,  which  is  replaced  by  lie  in  DdH. 
However,  only  backbone  atoms  of  this  residue  are  includ¬ 
ed  in  our  model. 

DFT  Calculations.  Quantum  mechanics  (QM)  calcula¬ 
tions  were  carried  out  in  the  DFT  framework  with  the 
TURBOMOLE  6.4  suite  of  programs52  by  using  the  BP86 
functional53'54  in  conjunction  with  the  resolution-of-the- 
identity  (RI)  technique55  and  an  all-electron  valence  tri¬ 
ple-^  basis  set  with  polarization  functions  TZVP  for  all 
atoms.56 

Vibrational  analyses  were  carried  out  for  all  species  in 
their  minimum  on  the  potential  energy  hypersurface  us¬ 
ing  the  same  level  of  theory.  Due  to  the  atoms  con¬ 
strained  at  the  original  positions,  several  very  small  nega¬ 
tive  eigenvalues  of  the  Hessian  were  computed.  However, 
these  imaginary  frequencies  should  not  affect  the  (much 
higher  in  energy)  vCO  and  vCN  stretching  modes.  CO 
frequencies  were  scaled  according  to  the  scaling  factor:50 

vCO(Scal.)  =  1.023  •  vCO(Cai.)  -  49 

Deuterium  isotopic  effects  on  the  CO  frequencies  were 
investigated  by  replacing  the  hydrogen  terminally  coordi¬ 
nated  to  Fed  with  deuterium  and  re-calculating  vibration¬ 
al  frequencies  from  the  hessian.  In  some  forms  of  the  en¬ 
zyme,  an  additional  frequency  was  identified  in  the  range 
of  the  CO  and  CN-  stretching  frequencies.  This  frequency 
can  be  assigned  to  a  N-H  stretching  mode  of  Lys228  asso¬ 
ciated  to  the  hydrogen  involved  in  the  H-bond  with 
GIU231.  In  fact,  this  N-H  distance  is  unusually  large,  and 
the  corresponding  NH— 0Gia  distance  unusually  small, 
explaining  the  considerable  shift  to  lower  wavenumbers 
of  this  stretching  mode,  which  is  probably  an  artefact  of 
the  model  used  for  the  computational  analysis.  Since  this 
mode  can  be  coupled  to  the  vCO  and  vCN  stretching 
modes,  thus  possibly  affecting  their  frequency  values,  we 
also  considered  to  replace  this  hydrogen  with  deuterium 
in  order  to  shift  the  N-H  stretching  frequency  well  out¬ 
side  the  region  of  the  vCO  and  vCN  stretching  modes; 
notably,  the  additional  set  of  calculations  on  such  deuter- 
ated  models  did  not  allow  us  to  observe  any  significant 
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change  in  the  computed  vCO  and  vCN  stretching  fre¬ 
quencies,  as  compared  to  the  parent  models. 

RESULTS  AND  DISCUSSION 

Biochemical  Properties  of  Purified  CrHydAi 
Cysi69Ser.  The  specific  activity  (U),  defined  as  i  pmol  H2 
produced  min"1  mg"1,  ranged  between  15  and  30  U  for 
Cysi69Ser  CrHydAi.  Although  active,  the  level  of  H2  evo¬ 
lution  activity  represents  a  significant  decrease  (30-40 
fold)  compared  to  wild-type  CrHydAi.  Use  of  increased 
concentrations  of  methyl  viologen  (MV,  80  mM)  and 
NaDT  (160  mM)  resulted  in  increased  H,  evolution  activi¬ 
ty  to  a  Vmax  of  84  U,  further  demonstrating  that  the 
Cys— »Ser  variant  retains  catalytic  function,  albeit  at  re¬ 
duced  levels.  Despite  the  reduced  activity,  both  EPR  and 
FTIR  spectra  of  Cysi69Ser  displayed  strong  spectroscopic 
signals,  indicating  that  H-cluster  incorporation  was  min¬ 
imally  affected  during  in  vivo  biosynthesis.  This  is  con¬ 
sistent  with  the  previous  initial  characterization  of  the 
Cysi69Ser  CrHydAi  variant,31  however  for  those  studies 
this  substitution  was  reported  to  be  completely  deficient 
in  H2  evolution  and  H2  oxidation  activities. 

Spectroscopic  Properties  of  As-Isolated  Cysi69Ser. 

We  showed  previously  that  when  WT  CrHydAi  was  puri¬ 
fied  in  NaDT  buffer,  a  major  fraction  equilibrated  into  the 
Hox  state,  with  minor  contributions  from  other  reduced 
states.15.  This  was  evident  from  a  strong  9=2.1  Hox  EPR 
signal  assigned  to  the  mixed-valent  Fe(II)Fe(I)  [2Fe]H  and 
diamagnetic  [4Fe-4S]H2+ subclusters,  with  minor  contribu¬ 
tions  from  other  reduced  states.15  In  contrast,  the  EPR 
spectrum  of  an  as-isolated  sample  of  Cysi69Ser  in  buffer 
containing  5  mM  NaDT  primarily  displayed  a  strong 
rhombic  2.07  signal  ( g  =  2.068,  1.943,  1.881,  Topt  =  11  K) 
along  with  an  overlapping  rhombic  2.06  signal  ( g  =  2.065, 
1.969,  1.906,  Topt  =  23  K)  (Figure  2A).  The  rhombic  2.07 
signal  decreased  in  intensity  with  increasing  temperature 
and  was  nearly  silent  at  36  K  (Figure  S2).  Above  36  K,  the 
rhombic  2.06  signal  became  more  resolved  and  could  be 
observed  up  to  50  K  without  much  broadening.  The  tem¬ 
perature  profiles  of  these  two  signals  are  most  consistent 
with  either  a  S=V2  [4Fe-4S]H1+  cluster  (rhombic  2.07)  or  a 
S=V2  [2Fe]H  cluster  (rhombic  2.06).  These  profiles  are 
nearly  identical  to  the  rhombic  2.08  ( g  =  2.077,  1-935, 
1.880)  and  2.06  ( g  =  2.061, 1.968, 1.900)  signals,  respective¬ 
ly,  we  previously  reported  for  reduced  WT  CrHydAi.15  The 
broad  shoulder  near  g  =  2.1,  more  evident  at  elevated 
temperatures  (23  K),  indicates  a  small  contribution  of  Hox 
to  the  overall  spectrum,  which  produces  a  rhombic  2.1 
signal  ( g  =  2.100,  2.039, 1.990). 57 

The  FTIR  spectrum  of  the  as-isolated  Cysi69Ser  (Figure 
2A)  displayed  numerous  peaks  in  regions  where  CO 
(-1775-2030  cm'1,  8  peaks)  and  CN"  stretching  modes 
(-2030-2100  cm"1,  4  peaks)  are  detected,  indicating  the 
presence  of  multiple  intermediates.  The  peaks  assigned  to 
vCO  modes  at  i860,  1869,  i960,  1967,  1977,  and  1984  cm"1 
and  vCN  modes  at  2074  and  2080  cm"1  most  likely  corre¬ 
late  with  the  two  rhombic  2.07  and  2.06  signals  of  the 


reduced  H-cluster.  The  peaks  observed  at  1800, 1936, 1969, 
2067,  and  2087  cm"1  are  assigned  to  the  Hox  state,15  and  are 
consistent  with  the  EPR  spectrum  having  a  g  =  2.1  Hox 
signal.  The  as-isolated  sample,  showed  IR  peaks  at  1967 
and  1969  cm"1  that  are  assigned  to  the  reduced  and  oxi¬ 
dized  H-cluster,  respectively,  based  on  comparison  of 
NaDT-treated  to  auto-oxidized  samples  (vide  infra).  At 
higher  concentrations,  a  vCO  peak  at  1896  cm"1  was  ob¬ 
served  (Figure  2E)  that  is  assigned  to  the  Hred  state  with  a 
[4Fe-4S]H2+-Fe(I)Fe(I)-[H]  H-cluster.20  A  similar  set  of 
peaks  at  1878,  1891,  and  1900  cm"1  were  also  observed  in 
the  corresponding  Cal  variant  Cys298Ser  (Figure  S3).  The 
slight  differences  in  vCO/vCN  peak  intensities  and  ener¬ 
gies  between  Cysi69Ser  and  Cys298Ser  indicate  the  H- 
cluster  in  the  two  enzymes  populate  similar  intermedi¬ 
ates,  but  at  fractional  levels.  These  differences  are  likely 
due  to  long-range  electronic  and  ET  effects  of  F-clusters 
present  in  Cal,  but  not  CrHydAi,  on  the  H-cluster  elec¬ 
tronic  state.58 

Spectroscopic  Properties  of  Oxidized  Cysi69Ser. 

The  oxidation  of  as-isolated  Cysi69Ser  was  carried  out  by 
an  auto-oxidization  procedure,  which  involves  exchange 
and  equilibration  into  buffer  without  NaDT.15  This  proce¬ 
dure  requires  enzyme  turnover  to  achieve  equilibration 
into  Hox,  therefore  to  account  for  the  much  lower  turno¬ 
ver  frequency  of  the  Cysi69Ser  variant  this  procedure  was 
carried  out  for  7  to  14  days  at  4  °C.  The  resulting  EPR 
spectra  yielded  almost  total  attenuation  of  the  rhombic 
2.07  signal  along  with  the  appearance  of  a  weak,  but  well 
resolved  rhombic  2.1  signal  (g  =  2.099,  2.040,  1.996,  Topt  = 
20  K)  of  the  Hox  state  (Figure  2B,  dark  shade).  The  rhom¬ 
bic  2.06  EPR  signal  was  also  present  at  a  similar  intensity 
to  the  as-isolated  sample  as  determined  by  double-spin 
integration  of  the  signal.  Depending  on  the  sample,  weak 
intensities  of  the  rhombic  2.07  signal  remained  present 
(Figure  2B,  light  shade).  There  was  a  minimal  contribu¬ 
tion  of  the  CO  inhibited  state  (Hox-CO)  to  the  spectra, 
which  is  indicative  of  H-cluster  degradation.  Further  oxi¬ 
dation  by  methyl  viologen  gave  an  increase  in  signal  in¬ 
tensity  of  the  rhombic  2.1  signal  along  with  appearance  of 
an  axial  g  =  2.05  signal  that  can  be  assigned  to  Hox-CO 
(Figure  S4).  As  also  pointed  out  earlier,31  the  evidence  for 
EPR  silent  species  formed  upon  auto-oxidation  is  con¬ 
sistent  with  the  weak  spectral  intensity,  which  was  20% 
(Figure  2B)  compared  to  the  as-isolated  overall  signal  at 
23  K.  In  the  earlier  report,  no  EPR  signal  was  detected  in 
the  absence  of  NaDT.  Several  defined  H-cluster  states 
such  as  Hred  and  Hinact  (Figure  1)  have  been  shown  to  be 
EPR  silent.40-42’39 

The  FTIR  spectrum  of  auto-oxidized  Cysi69Ser  showed 
vCO  peaks  at  1798  (g-CO),  1936  and  1969  cm"1  (terminal), 
and  vCN  peaks  at  2067,  and  2087  cm"1,  which  are  assigned 
to  the  Hox  state.  Enrichment  of  the  Hox  signal  in  the  IR 
spectrum  correlates  nicely  with  the  appearance  of  the 
EPR  rhombic  2.1  signal  assigned  to  Hox  (Figure  2B).  Addi¬ 
tional  IR  peaks  at  1944,  1985  (vCO),  and  2072  cm'1  (vCN) 
indicate  the  presence  of  other  intermediates. 
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Figure  2.  EPR  and  FTIR  of  Cysi69Ser  under  steady-state  reducing  and  oxidizing  conditions.  Panels  show  EPR  (left)  and 
FTIR  (right)  spectra  of  Cysi69Ser  samples.  (A)  As-isolated.  (B)  Auto-oxidized  sample  1  dark  shade,  sample  2  light-shade.  (C) 
NaDT.  (D)  100%  I  h  treated.  (E)  High  concentration  as-isolated  sample  (120  mg/mL).  (F)  High  concentration  100%  H2  treated 
sample  (120  mg/mL).  Simulations  of  EPR  signals  (left  panel)  are  shown  on  top:  red,  rhombic  2.1  (Hox);  blue,  rhombic  2.06;  green, 
rhombic  2.07.  The  vCN  and  vCO  peaks  of  FTIR  spectra  (right  panel)  assigned  to  Hox  are  in  red,  and  reduced  or  unassigned  sig¬ 
nals  are  in  black.  FTIR  spectra  were  collected  at  room  temperature.  EPR  spectrometer  settings:  temperature,  11  K;  microwave 
power,  1.0  mW,  microwave  frequency,  9.38  -  9.39  GHz;  modulation  frequency,  100  kHz;  modulation  amplitude,  10.0  G;  time  con¬ 
stant,  327.68  ms. 


The  peak  at  1985  cm'1  mostly  correlated  with  the  presence 
of  the  rhombic  2.06  signal  and  the  peaks  at  1944  and  2072 
cm'1  more  likely  correlate  to  an  EPR  silent  species,  since 
the  latter  peaks  have  only  been  observed  for  oxidized 
samples. 

Spectroscopic  Properties  of  NaDT  and  H2  Reduced 
CrHydAi  Cysi69Ser.  Reduction  of  Cysi69Ser  with  excess 
NaDT  (Figure  2C)  primarily  yielded  the  rhombic  2.07  sig¬ 
nal,  which  was  more  sharp  and  intense  than  in  the  as- 
isolated  Cysi69Ser  sample.  Treatments  with  H2  yielded 


similar  spectra  (Figure  2D),  also  primarily  composed  of 
the  sharp  rhombic  2.07  signal.  This  differed  from  the  WT 
samples  under  H2,  which  displayed  a  broadened  signal  ( g 
=  2.3-2.07)  compared  to  the  rhombic  2.07  (Table  S2).15 
Also  present  in  Cysi69Ser  under  NaDT  and  H2  reduction 
was  a  second,  overlapping  rhombic  2.06  signal,  present  at 
a  similar  intensity  as  for  the  as-isolated  and  auto-oxidized 
samples.  For  the  H2  sample,  the  feature  at  g  =  1.997  indi¬ 
cates  a  small  presence  of  Hox  that  likely  did  not  react  with 
H2.  Simulations  show  that  the  overall  spectra  of  reduced 
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samples  consist  of  86%  of  the  2.07  signal  and  14%  of  the 
2.06  signal.  The  power  saturation  dependency  of  the 
rhombic  2.07  signal  was  nearly  identical  to  the  WT  rhom¬ 
bic  2.08  signal,  and  saturated  with  increasing  power  at  11 
K  more  quickly  than  at  23  K,  indicating  a  similar  source 
for  the  two  signals  (Figure  S5). 

The  source  of  the  rhombic  2.07  signal  in  Cysi69Ser  is  as¬ 
signed  to  a  S  =  Vi,  [4Fe-4S]H1+  cluster  and  is  highly  related 
to  the  rhombic  2.08  signal  observed  in  the  WT  CrFlydAi.15 
The  assignment  of  rhombic  2.06,  however,  has  two  possi¬ 
bilities;  either  it  originates  from  a  S  =  Vi,  [2Fe-2S]1+  ( vide 
infra)  based  on  the  temperature  saturation15  or  a  S  =  Vi, 
[4Fe-4]H1+  that  is  similar  to  [4Fe-4S]H1+  of  the  Htrans  state  of 
DdH. 40-42  The  EPR  signal  assigned  to  Htrans  shows  similar 
g-values  (g  =  2.06,  1.96,  1.89)  and  temperature  properties 
([2Fe-2S]1+-like)  to  rhombic  2.06  of  Cysi69Ser.  For  the 
Htrans  signal,  Mossbauer  studies  assigned  the  H-cluster 
oxidation  state  to  [4Fe-4S]H1+  paired  with  a  diamagnetic 
Fe(II)Fe(II)  [2Fe]H  subcluster.36'37  An  Htrans  state  was  also 
reported  for  the  as-isolated  Cysi69Ser  by  Knorzer  et  al.f 
however,  this  assignment  was  based  on  an  EPR  signal 
with  (/-values  of  2.067,  i-94,  1.88.  Those  (/-values  more 
closely  align  with  the  g  =  2.07  signal  (2.068,  1.943,  1.881) 
we  observed  for  Cysi69Ser.  They  suggested  that  changes 
in  redox  properties  could  allow  a  nearby  H2C)  to  react 
with  Fed  to  produce  a  terminal  -OH  ligand  as  modeled  by 
earlier  DFT  studies  of  Htrans  and  also  Hinact, 59,60  although 
this  ligand  has  not  yet  been  observed  experimentally.38 
Thus,  the  two  overlapping  2.07  and  2.06  EPR  signals  of 
Cysi69Ser  (and  2.08/2.06  for  wild-type  CrHydAi)15  repre¬ 
sent  discrete  states,  and  are  distinguished  by  slight  differ¬ 
ences  in  (/-tensor,  and  temperature  and  power  properties. 
We  propose  these  states  arise  from  changes  at  [2Fe]H  that 
lead  to  each  state  having  a  unique  terminal  ligand,  elec¬ 
tronic  spin  and  charge  distribution,  coordination  envi¬ 
ronment  and/or  protonation  state  of  adt. 

The  FTIR  spectra  of  Cysi69Ser  reduced  with  NaDT  (Fig¬ 
ure  2C)  or  H2  gas  (Figure  2D)  showed  complexity  in  the 
terminal  vCO  region  (1900-2030  cm'1),  but  at  higher  ener¬ 
gy  positions  compared  to  WT  CrHydAi.  In  addition,  there 
were  several  vCO  peaks  in  a  region  consistent  with  a 
“semi-bridging”  (--1850-1880  cm'1)  assignment.  Overall, 
under  reduction  with  NaDT,  13  peaks  were  observed  at 
i860,  1870,  1920,  1931,  1949,  i960,  1967,  1978,  1984,  2074, 
2079,  2°87,  and  2092  cm'1.  Likewise,  reduction  with  H2 
also  produced  a  complex  IR  spectrum  with  12  peaks  at 
1859,  1867,  1937,  i960,  1967,  1975,  1984,  2000,  2073,  2079> 
2086,  2088  cm'1.  The  slight  shift  of  the  1969  cm'1  peak  as¬ 
signed  to  Hox  WT,  to  1967  cm'1  in  Cysi69Ser,  may  reflect 
minor  rearrangements  of  [2Fe]H  during  reduction. 
Through  analysis  of  these  spectra  and  other  data  not 
shown,  it  appears  that  vCO  peaks  at  i860,  1977,  and  1984 
cm'1  and  peaks  at  1867,  i960,  and  1967  cm'1  can  be  assigned 
to  two  discrete  states.  It  is  possible  that  these  two  sets  of 
signals  correlate  to  the  rhombic  2.06  and  2.07  EPR  signals, 
however  the  potential  presence  of  EPR  silent  species 
along  with  the  different  temperatures  for  data  collection 


between  EPR  (11-60  K)  and  FTIR  (room  temperature), 
complicates  these  correlations.  For  higher  concentrated 
Cysi69Ser  samples  treated  with  H2  (Figure  2F),  residual 
Hox  was  observed  by  the  presence  of  peaks  at  1800,  1936, 
and  1969  cm'1,  whereas  additional  peaks  at  1874,  1944, 
i960,  1968,  1973,  1987,  2001,  2011,  2067,  2080,  2087,  2099 
2112  cm'1  are  similar  to  the  lower  concentrated  sample. 
This  may  indicate  a  decreased  ability  for  the  Cysi69Ser 
variant  to  react  with  H2  or  incomplete  H2  treatment,  since 
residual  Hox  peaks  are  also  observed  for  H2  treated  wild- 
type  CrHydAi.15 

Observation  of  Hydrogen/Deuterium  Isotope- 
Sensitive  FTIR  Peaks  in  Reduced  Cysi69Ser.  A  mecha¬ 
nism  for  H2  activation  by  [FeFe]-hydrogenases  has  been 
proposed  based  on  crystallographic  and  FTIR  analysis  of 
reduced  DdH  and  CrHydAi,  whereby  the  p-CO  is  pro¬ 
posed  to  alternate  between  bridging  (Hox)  and  terminal 
(Hred-hydride)  orientations  as  the  H-cluster  transitions 
through  a  catalytic  cycle.11’20'38,39'14  This  shift  of  the  p-CO  is 
assumed  to  occur  along  with  formation  of  terminal  hy¬ 
dride  on  [2Fe]H. 59,61  It  has  been  shown  for  metal-carbonyl 
organometallic  complexes  that  terminal  hydrides  exert  a 
strong  trans-influence  on  the  trans-ligand,  in  this  case  the 
p-CO  of  [2Fe]H-  This  can  result  in  mixing  of  the  vFeH  with 
vCO  modes  and  causes  an  hydrogen/deuterium  isotope 
effect  on  the  position  of  vCO  in  the  IR  spectrum.62  Nota¬ 
bly,  the  H/D  isotope  effect  induces  a  shift  of  vCO  to  lower 
energies  after  H— »D  substitution,  with  only  small  to  neg¬ 
ligible  effect  on  cis  CO  ligands.63-65  Thus,  reduction  of 
[FeFe]-hydrogenases  in  H-  or  D-enriched  buffers  can  be 
used  to  assign  vCO  peaks  to  the  p-CO  ligand  based  on  a 
isotope  induced  shift,  and  provide  indirect  evidence  for 
an  H-species  bound  to  the  terminal  position  on  [2Fe]H- 

As  shown  in  Figure  3  (detailed  spectra,  Figure  S6),  re¬ 
duction  of  Cysi69Ser  by  NaDT  or  H2  produces  vCO  peaks 
in  the  1860-1870  cm-1  region,  which  are  consistent  with  a 
“semi-bridging”  assignment.  After  exchange  from 
H20— >D20  enriched  buffers  followed  by  reduction  with 
NaDT  or  D2,  these  peaks  shifted  to  slightly  lower  energies. 
Specifically,  the  i860  and  1868  cm-1  peaks  shifted  to  1853 
and  1864  cm-1  after  reduction  with  NaDT,  and  the  peak 
observed  at  1859  cm-1  under  100%  H2-treatment  in  H20, 
shifted  to  1853  cm-1  under  100%  D2  in  D,0  buffer  (Figure 
3).  These  shifts  are  consistent  with  a  change  in  mass  of  a 
terminal  ligand  on  Fed  that  is  positioned  trans  to  the 
semi-bridging  CO,  and  supports  a  terminal  H-species 
(e.g.,  hydride)  coordinated  to  the  Fed  atom  of  [2Fe]H  in 
these  reduced  states.  The  DFT  calculations  shown  in  Fig¬ 
ure  3  on  the  reduced  H-cluster  in  Cysi69Ser  simulated 
with  either  H  or  D  terminal  ligands  also  show  a  similar 
H— >D  induced  downshift  in  the  p-CO  peak. 
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Figure  3.  Comparison  of  experimental  FTIR  bands  of 
CrHydAi  CysibgSer  after  reduction  to  calculated  FTIR 
bands  from  a  DFT  model  of  Cysi69Ser.  (A)  Experimental 
FTIR  bands  in  vCO  region  from  CrHydAi  Cysi69Ser  enzyme 
reduced  with  either  NaDT,  H2,  or  D,  in  buffer  prepared  with 
either  I  120  or  D20.  (B)  DFT  calculated  FTIR  bands  in  the 
vCO  region  based  on  the  Cysi69Ser  H-cluster  model  pre¬ 
pared  in  H20  or  D20  solvent.  Bands  indicated  in  red  (H20) 
are  observed  to  shift  to  slightly  lower  energies  under  D20 
(blue).  Bands  in  light  gray  are  either  assigned  to  the  Hox  state 
or  are  unassigned. 

DFT  calculations  of  the  vCO  frequencies  in  reduced 
Cysi69Ser  and  WT  enzymes.  The  observation  of  the 
H— >D  exchange  effect  on  the  p-CO  bands  called  for  a 
detailed  DFT  analysis  of  adducts  featuring  a  terminal  hy¬ 
dride  trans  to  the  p-CO  on  Fed.  In  addition,  a  comparison 
of  the  FTIR  spectrum  of  WT  CrHydAi  to  the  Cysi69Ser 
variant  after  treatment  with  100%  H2  shows  the  principle 
vCO  peaks  shift  to  higher  energies  (Figure  S7),  suggesting 
that  binding  H2  to  Hox  induces  a  further  oxidation  of  the 
[2Fe]H  subcluster  from  a  mixed-valent  Fe(II)Fe(I)  to  a 
diferrous  state.  Notably,  DFT  vibrational  analysis  per¬ 
formed  on  Cysi69Ser  species,  in  which  the  [2Fe]H  subclus¬ 
ter  attains  the  Fe(II)Fe(II)  state,  and  the  [qFe-qSJn  cluster 
is  in  the  reduced  [4Fe-4S]H1+  state,  shows  that  the  princi¬ 
ple  vCO  peaks  occur  at  1878,  1971,  and  1992  cm'1  or  1862, 
1951,  and  1965  cm'1,  depending  on  the  protonation  state  of 
the  adt  ligand  (3b  and  4b  in  Figure  4  and  Table  1).  In  par¬ 
ticular,  the  assignments  of  vCO  frequencies  reported  in 
Table  1  for  3b  and  4b  are  associated  to  mean  absolute  er¬ 
rors  (MAE)  as  low  as  9  and  5  cm"1,  respectively,  compared 
to  the  experimental  frequencies  that  correlate  with  the 
rhombic  2.06  and  2.07  EPR  signals  ( vide  infra).  This  sug¬ 
gests  that  these  two  states  might  actually  differ  in  the 
protonation  state  of  the  adt  ligand.  As  far  as  the  effects  of 
H— »D  substitution  on  theoretical  vibrational  spectra  are 


concerned,  the  presence  of  a  D"  terminal  ligand  in  the 
model  leads  to  a  significant  downshift  in  the  p-CO  ener¬ 
gies,  in  agreement  with  experiments  (see  Figure  3). 

A  very  good  match  between  computed  and  experi¬ 
mental  values  can  also  be  found  when  WT  CrHydAi  is 
taken  into  account:  in  particular,  the  comparison  between 
theoretical  frequencies  of  model  4a  and  the  available  ex¬ 
perimental  data  highlights  a  MAE  as  low  as  9  cm'1.  The 
latter  result  suggests  that  the  higher  oxidation  state  in¬ 
termediate  also  forms  in  the  WT  form  of  the  enzyme.  This 
conclusion  is  further  supported  by  the  EPR  spectra  for 
which  similar  rhombic  signals  are  observed  in  both 
Cysi69Ser  and  WT  under  reduction  ( vide  infra).  Multiple 
theoretical  studies  have  predicted  formation  of  a  [2Fe]H 
intermediate  with  Fe(II)Fe(II)  oxidation  states  upon  H2 
binding  and  activation  at  the  H-cluster. 19,59,50  Experi¬ 
mental  evidence  from  x-ray  absorption  studies  on 
CrHydAi  also  indicated  a  slight  oxidation  of  [2Fe]H  upon 
H2  binding  by  Hox,66  consistent  with  the  transition  of 
[2Fe]H  from  Fe(II)Fe(I)  to  Fe(II)Fe(II). 

Table  1.  Calculated  and  experimental  CO  vibrational 
frequencies  for  the  H-cluster  model  intermediates 
described  in  this  study  (Figure  4). 


CO  Vibrational  Frequencies  (cm1) 

Model 

DFT 

Experimental 

ia 

1841, 1949, 1989 

1802, 1940, 1964° 

lb 

1831, 1948, 1995 

1798, 1936, 1969 

2a 

I905,  1991,  2011 

not  observed 

2b 

1902,  1992,  2012 

not  observed 

3a 

1879, 1972, 1992 

1861, 1961,  i979a,b 

3b 

1878, 1971, 1992 

i860, 1977, 1984° 

4a 

1862, 1951, 1964 

1861, 1961,  i979a,b 

4b 

1862, 1951, 1965 

1867,  i960, 1967° 

5a 

1803, 1891, 1904 

1792,  1891,  I9i5a 

aWT  signals  are  reported  from  a  previous  analysis  of 
CrHydAi.15  bExperimental  assignment  of  these  CO  peaks 
cannot  be  distinguished  between  models  3a  and  4a.  ‘  Experi¬ 
mental  assignment  of  these  CO  peaks  cannot  be  distin¬ 
guished  between  models  3b  and  4b. 

Disruption  of  PT  in  Cysi69Ser  Causes  a  Shift  in  the 
Steady-state  Equilibria  of  H-cluster  Redox  States.  The 

strong  bias  for  CrHydAi  Cysi69Ser  to  equilibrate  into  a 
[4Fe-4S]1+-Fe(II)Fe(II)  oxidation  state  is  again  consistent 
with  the  Cys-to-Ser  substitution  disrupting  fast  PT  during 
the  catalytic  cycle  of  the  H-cluster.  The  vCO  peaks  ob¬ 
served  in  the  reduced  WT  CrHydAi  that  occur  at  1882, 
1891,  1919,  1920  and  1954  cm"1  are  significantly  weaker  in 
the  FTIR  of  reduced  Cysi69Ser.  Our  initial  assignments  of 
WT  CrHydAi  EPR  signals  observed  under  NaDT  reduc¬ 
tion  (rhombic  2.08)  and  H2  reduction  (broad  2.3-2.07) 
were  Hsred  or  [4Fe-4S]H1+-Fe(I)Fe(I)  and  [qFe^sy4- 
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Fe(II)Fe(II)  state,  respectively.15  A  more  comprehensive 
analysis  that  considers  the  results  presented  here  for 
Cysi69Ser  suggests  the  rhombic  2.07/2.08  signals  better 
correlate  to  a  [4Fe-4S]H1+-Fe(II)Fe(II)  assignment.  The 
contribution  of  the  broad  signal  is  strongest  in  the  WT 
EPR  spectrum  when  the  1883,  1891,  1916,  and  1933  cm"1 
vCO  peaks  dominate  the  FTIR  spectrum/5  suggesting  the¬ 
se  might  be  assigned  to  a  [4Fe-4S]H1+-Fe(I)Fe(I)  FI -cluster. 
Likewise,  the  rhombic  2.08  signal  in  WT  CrFlydAi  corre¬ 
lated  best  with  the  occurrence  of  1861,  1961  and  1979  cm"1 
vCO  peaks/5  which  matches  with  the  rhombic  2.07  signal 
and  higher  energy  vCO  peaks  of  Cysi69Ser  (Figure  2).  By 
comparison  the  FIsred  state  of  WT  CrFlydAi  described  by 
Adamaska  et  al.14  as  [4Fe-4S]H1+-Fe(I)Fe(I)  was  formed 
under  reduction  by  NaDT  and  100%  FF  with  vCO  peaks  at 
1954,  1919  and  1882  cm"1,  and  a  rhombic  EPR  signal  with 
properties  most  similar  to  the  rhombic  2.07/2.08  signals 
we  observed  in  Cysi69Ser  and  WT,  respectively  (Figure  2, 
Table  S2).  Although  the  catalytic  relevance  of  Hsred  has 
recently  been  questioned,67  these  studies  indicate  that 
formation  of  Hred  and  FIsred  require  functional  PT,  and  that 
reduction  of  the  [2Fe]H  subcluster  to  the  Fe(I)Fe(I)  level  is 
accomplished  via  a  PCET  mechanism. 

Disruption  of  PT  pathway  and  the  subsequent  preven¬ 
tion  of  electronic  transitions  at  the  H-cluster  may  also 
explain  accumulation  of  a  [4Fe-4S]H1+  subcluster  signal  in 
the  EPR  spectrum  of  reduced  Cysi69Ser.  Intramolecular 
electron  transfer  between  the  [qFe-qSJn  and  [2Fe]H  sub¬ 
clusters  is  facilitated  by  strong  spin  coupling  exchange 
through  the  bridging  Cys  thiolate.36,37,61’68"70  This  coupling 
may  be  disrupted  when  PT  and  ET  steps  are  decoupled  in 
Cysi69Ser,  further  demonstrating  that  the  electronic 
properties  of  the  H-cluster  are  tightly  coupled  to  its  sur¬ 
rounding  protein  framework. 69,71"73’31,74’75 

In  addition,  the  substitution  of  a  more  electronegative 
group  like  -OH  for  -SH  just  3.5  A  away  from  the  bridge¬ 
head  amine  could  alter  H-cluster  interaction  and  cause 
changes  to  the  electronic  distribution.  Model  studies  on 
[2Fe]H  compounds  show  that  electron-delocalization  ex¬ 
tends  into  the  adt  ligand  suggesting  that  nearby  changes 
such  as  H-bonding  or  differences  in  protonation  state  of 
the  bridgehead  amine  could  result  in  different  H-cluster 
properties.76  The  latter  observation  is  consistent  with  the 
calculations  here,  which  give  different  vCO  frequencies 
for  H-cluster  states  where  the  only  difference  is  the  pro¬ 
tonation  state  of  the  bridgehead  (Figure  4).  Interestingly, 
the  upward  vCO  shifts  observed  in  this  study  compare 
nicely  to  FTIR  of  a  NaDT  reduced  CrHydAi  that  was  re¬ 
constituted  with  a  synthetic  [2Fe]H  mimic  containing  an 
O  atom  in  the  form  of  pdt  in  place  of  N  at  the  bridgehead 
position.13 

Spectroscopic  Properties  of  CO  Treated  CysihqSer. 

CO  is  known  to  inhibit  [FeFe]-hydrogenase  catalytic  ac¬ 
tivity  by  binding  reversibly  to  the  terminal  site  on  Fed  of 
the  oxidized  [2Fe]H  subcluster,77  which  results  in  distinc¬ 
tive  Hox-CO  signals  in  both  EPR"8'57'41  and  FTIR 
spectra.43,79  Thus,  the  requirement  for  an  open  site  on  Fed 


for  exogenous  CO  binding  together  with  the  unique  spec¬ 
tral  signatures  provide  a  means  to  probe  the  coordination 
sphere  and  occupancy  of  Fed. 
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Figure  4.  Schematic  of  model  for  H,  activation  by  WT  and 
Cysi69Ser  variant  of  CrHydAi  based  on  experimental  results 
and  DFT  calculations.  The  respective  thiol  and  hydroxyl  side 
chains  of  nearby  Cys  and  Ser  residues  are  noted  in  green  and 
purple. 

The  interaction  of  CO  with  Cysi69Ser  was  explored  by 
treating  differently  prepared  Cysi69Ser  samples  with 
100%  CO  gas.  As  reported  by  Knorzer  et  al.,31  the  treat¬ 
ment  of  as-isolated,  or  NaDT  reduced  Cysi69Ser  with 
100%  CO  did  not  give  rise  to  the  characteristic  Hox-CO 
EPR  axial  signal  of  CrHydAi  (data  not  shown).39’15 

By  comparison,  treatment  of  auto-oxidized  Cysi69Ser 
with  CO  yielded  an  axial  2.05  signal  ( g  =  2.047,  2  005, 
2.007)  assigned  to  formation  of  Hox-CO  (Table  S2).  This 
signal  coincided  with  the  complete  replacement  (or  at- 
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tenuation)  of  the  rhombic  2.1  Hox  signal  (Figure  5).  How¬ 
ever,  the  intensity  of  the  rhombic  2.06  signal  present  in 
the  auto-oxidized  EPR  spectrum  remained  unchanged 
after  CO  exposure  (Figure  5)  indicating  that  this  H-cluster 
species  does  not  readily  react  with  CO.  The  observations 
that  the  Hox  state  in  Cysi69Ser  occurs  only  after  pro¬ 
longed  auto-oxidation,  that  CO  requires  an  open  coordi¬ 
nation  site  on  Fed  to  induce  formation  of  Hox-CO,8°  and 
that  the  state  that  gives  rise  to  the  rhombic  2.06  signal 
does  not  readily  react  with  CO,  are  consistent  with 
Cysi69Ser  equilibrating  into  a  reduced  state  that  harbors 
a  terminally  bound  H-species. 


g-value 

2.20  2.10  2.00  1.90  1.80 


Gauss 


Figure  5.  EPR  (top  panel)  and  FTIR  (bottom  panel)  of 
auto-oxidized  Cysi69Ser  treated  with  100%  CO.  The  EPR 
spectrum  collected  at  23  K  (black)  can  be  simulated  (gray) 
with  a  combination  of  45%  rhombic  2.06  (blue)  and  55% 
axial  2.05  (red)  components.  Sample  concentration,  12 
mg/mL.  Other  spectrometer  settings  as  Figure  2  caption. 

Likewise,  the  FTIR  spectrum  of  Cysi69Ser  sample  also 
showed  the  strongest  Hox-CO  signal  after  auto-oxidization 
and  CO  treatment.  The  Hox-CO  spectrum  of  Cysi69Ser 


was  similar  to  WT  showing  an  upshift  of  p-CO  from 
1802— >1816  cm"1,  with  an  additional  vCO  peak  at  2005  cm"1 
from  binding  and  vibrational  coupling  of  a  terminal  exog¬ 
enous  CO  (Figure  5).  For  WT  CrHydAi,  binding  of  an  ex¬ 
ogenous  CO  to  the  terminal  site  of  [2Fe]H  gives  rise  to  a 
peak  at  2013  cm"'.  The  7  cm"1  difference  of  this  vCO  peak  in 
Cysi69Ser  versus  WT  may  arise  from  slight  differences  in 
coordination  of  other  CO  and  CN"  ligands,  or  H-bonding 
and  charge  distribution  around  the  H-cluster. 

Accumulation  of  a  Reduced  Intermediate  with  a 
Hydrogen  Species  Bound  to  [2Fe]H  of  CysihqSer.  The 
observation  that  Cysi69Ser  forms  a  reduced  state  that 
does  not  easily  react  easily  with  CO  suggests  that  the  CO 
binding  site,  located  at  the  open  coordination  site  at  Fed 
of  [2Fe]H,77  is  occupied  by  another  terminal  ligand.  While 
the  possibility  for  altered  redox  properties  of  the  H- 
cluster  in  Cysi69Ser3'  might  affect  the  reactivity  with  CO, 
it  is  also  possible  that  an  H-species  is  stabilized  at  the 
distal  site  of  this  state  due  to  disruption  of  PT.  Since  H2 
formation  and  oxidation  is  a  two  proton  process,  an  in¬ 
termediate  such  as  [4Fe-4S]H1+-Fe(II)Fe(II)-H  might  ac¬ 
cumulate,  which  is  consistent  with  EPR/FTIR  spectra  and 
the  observed  isotope  sensitive  vCO  modes.  Similar  spec¬ 
tral  features  are  observed  in  reduced  WT  CrHydAi,  but  at 
lower  intensities  compared  to  Cysi69Ser.  We  hypothesize 
the  differences  are  due  to  a  shift  in  the  equilibrium  of 
reduced  states  from  the  disruption  of  a  PT-dependent 
step  (summarized  in  Figure  4),  and/or  a  shifts  in  the  mid¬ 
point  potentials  of  reduced  states  as  presented  in  Figure 
2  37,38 

Buildup  of  H2-derived  species  at  Fed  fits  the  model  of  ca¬ 
talysis  for  which  protons  are  shuttled  between  the  en¬ 
zyme  surface  and  catalytic  site  with  the  terminal  step  of 
the  primary  PT  pathway  being  proton  exchange  between 
Cysi69  and  adt.  Substitution  of  Cys-to-Ser  is  likely  to  slow 
this  step,  and  indirectly  limit  the  exchange  of  protons 
between  Fed  of  [2Fe]H  and  adt  and  possibly  by  disrupting 
the  H-bonding  network  of  the  pathway.28  While  the  Cys 
residue  belongs  to  the  primary  PT  pathway,  several  other 
PT  networks  have  been  identified27  that  perhaps  function 
as  a  secondary  pathway  in  Cysi69Ser.  This  may  explain 
the  observations  in  this  study  that  although  Cysi69Ser  has 
much  lower  catalytic  activity,  it  still  exhibits  spectral 
properties  of  H-cluster  electronic  and  structural  transi¬ 
tions. 

CONCLUSIONS 

Using  EPR  and  FTIR  spectroscopy  in  conjunction  with 
DFT  calculations  we  have  provided  evidence  that  disrup¬ 
tion  of  the  primary  PT  pathway  by  the  Cysi69— >Ser  sub¬ 
stitution  in  CrHydAi  substantially  reduces  catalytic  activi¬ 
ty,  and  alters  steady-state  equilibration  of  the  H-cluster 
among  catalytic  states.  We  hypothesize  this  effect  arises 
from  decoupling  of  PT  and  ET  steps,  and/or  changes  in 
mid-point  potentials  with  the  exchange  of  Cys— 3-Ser.  We 
demonstrated  that  under  reduction,  Cysi69Ser  was  biased 
towards  H-cluster  states  consisting  of  a  S=V2,  [4Fe-4S]H1+ 
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subcluster,  signifying  the  importance  of  intramolecular 
ET  to  catalysis  by  [FeFe]-hydrogenase.  Likewise,  the  FTIR 
spectra  and  DFT  calculations  showed  that  the  [2Fe]H  sub¬ 
cluster  of  Cysi69Ser  was  biased  more  strongly  towards  a 
Fe(II)Fe(II)  redox  state.  The  fact  that  one  of  these  difer- 
rous  states  (rhombic  2.06)  was  unreactive  with  CO  and 
showed  an  H— >D  sensitive,  semi-bridging  p-CO  is  con¬ 
sistent  with  the  presence  of  an  exchangeable,  terminal  H- 
species  trans  to  p-CO.  The  properties  of  these  [4Fe-4S]H1+- 
Fe(II)Fe(II)  H-cluster  states  agree  well  with  activation 
model  of  H2  by  the  H-cluster.  Indeed,  DFT  calculations  on 
the  model  used  in  this  work  show  that  activation  of  H2  in 
a  [4Fe-4S]2+-Fe(II)Fe(II)  H-cluster  is  slightly  exothermic 
by  -2  kcal/mol,  whereas  activation  on  the  corresponding 
[4Fe-4S]2+-Fe(II)Fe(I)  is  endothermic  by  +5  kcal/mol.  A 
diferrous  activation  state  was  also  proposed  earlier  by  Fan 
and  Hall,19  and  supported  by  recent  results  on  a  H-cluster 
model  where  heterolytic  cleavage  of  H2  by  the  diiron  sub¬ 
site  was  stimulated  by  initial  oxidation  to  the  diferrous 
state  by  a  covalently  attached  ferrocene.7 

ASSOCIATED  CONTENT 

Supporting  Information.  Table,  Schematic  and  Figures 
describing  the  computational  model  of  the  H-cluster.  Table 
and  Figures  of  EPR  signals,  temperature  profiles,  and  power 
saturation  data.  FTIR  Figures  of  isotope  spectra,  Cys298Ser 
Cal  spectrum,  and  difference  spectra  of  WT  and  Cysi69Ser 
CrHydAi  treated  with  H,. 

AUTHOR  INFORMATION 

Corresponding  Author 

*  paul.king@nrel.gov 

ACKNOWLEDGMENT 

(D.W.M,  M.W.R.,  and  P.W.K.  gratefully  acknowledge  fund¬ 
ing  support  from  the  U.S.  Department  of  Energy,  Division  of 
Chemical  Sciences,  Geosciences,  and  Biosciences,  Office  of 
Basic  Energy  Sciences;  and  support  of  the  U.S.  Department  of 
Energy  under  Contract  No.  DE-AC36-08-GO28308  with  the 
National  Renewable  Energy  Laboratory  for  the  hydrogenase 
purification,  biochemical  and  spectroscopy  studies.  M.B.  and 
C.G.  acknowledge  the  University  of  Milano-Bicocca  (FAR 
2013)  for  support  of  studies  on  H-cluster  models  and  DFT 
calculations.  E.K.  and  J.W.P.  thank  the  Air  Force  Office  of 
Scientific  Research  (grant  FA-9550-11-1-0218)  for  funding 
support. 

REFERENCES 

(1)  Lee,  H.  S.;  Vermaas,  W.  F.;  Rittmann,  B.  E.  Trends  Biotechnol. 
2010,  28,  262. 

(2)  Sakurai,  H.;  Masukawa,  H.;  Kitashima,  M.;  Inoue,  K.  J. 
Photochem.  Photobiol.,  C  2013, 17, 1. 

(3)  Vincent,  K.  A.;  Parkin,  A.;  Armstrong,  F.  A.  Chem.  Rev.  2007, 107, 
4366. 

(4)  Tard,  C.;  Liu,  X.;  Ibrahim,  S.  K.;  Bruschi,  M.;  De  Gioia,  L.;  Davies, 
S.  C.;  Yang,  X.;  Wang,  L.-S.;  Sawers,  G.;  Pickett,  C.  J.  Nature  2005, 
433,  610. 

(5)  DuBois,  M.  R.;  DuBois,  D.  L.  Chem.  Soc.  Rev.  2009, 38,  62. 

(6)  Gloaguen,  F.;  Rauchfuss,  T.  B.  Chem.  Soc.  Rev.  2009, 38, 100. 

(7)  Camara,  J.  M.;  Rauchfuss,  T.  B.  Nat.  Chem.  2012,  4,  26. 


(8)  Chenevier,  P.;  Mugherli,  L.;  Darbe,  S.;  Darchy,  L.;  DiManno,  S.; 
Tran,  P.  D.;  Valentino,  F.;  Iannello,  M.;  Volbeda,  A.;  Cavazza,  C.  C.  R. 
Chim.  2013, 16,  491. 

(9)  Hsieh,  C.  H.;  Ding,  S.;  Erdem,  O.  F.;  Crouthers,  D.  J.;  Liu,  T.; 
McCrory,  C.  C.;  Lubitz,  W.;  Popescu,  C.  V.;  Reibenspies,  J.  H.;  Hall, 

M.  B.;  Darensbourg,  M.  Y.  Nat.  Commun.  2014, 5,  3684. 

(10)  Peters,  J.  W.;  Lanzilotta,  W.  N.;  Lemon,  B.  J.;  Seefeldt,  L.  C. 
Science  1998,  282, 1853. 

(11)  Nicolet,  Y.;  Piras,  C.;  Legrand,  P.;  Hatchikian,  C.  E.;  Fontecilla- 
Camps,  J.  C.  Struct.  Fold.  Des.  1999,  7, 13. 

(12)  Silakov,  A.;  Wenk,  B.;  Reijerse,  E.;  Lubitz,  W.  Phys.  Chem.  Chem. 
Phys.  2009, 11,  6592. 

(13)  Berggren,  G.;  Adamska,  A.;  Lambertz,  C.;  Simmons,  T.; 
Esselborn,  J.;  Atta,  M.;  Gambarelli,  S.;  Mouesca,  J.-M.;  Reijerse,  E.; 
Lubitz,  W.  Nature  2013,  499,  66. 

(14)  Adamska,  A.;  Silakov,  A.;  Lambertz,  C.;  Rudiger,  O.;  Happe,  T.; 
Reijerse,  E.;  Lubitz,  W .Angew.  Chem.  Int.  Ed.  2012,51, 11458. 

(15)  Mulder,  D.  W.;  Ratzloff,  M.  W.;  Shepard,  E.  M.;  Byer,  A.  S.; 
Noone,  S.  M.;  Peters,  J.  W.;  Broderick,  J.  B.;  King,  P.  W.  /.  Am.  Chem. 
Soc.  2013, 135,  6921. 

(16)  Adamska-Venkatesh,  A.;  Krawietz,  D.;  Siebel,  J.;  Weber,  K.; 
Happe,  T.;  Reijerse,  E.;  Lubitz,  W  .J  Am  Chem  Soc  2014, 136, 11339. 

(17)  Lambertz,  C.;  Chernev,  P.;  Klingan,  K.;  Leidel,  N.;  Sigfridsson,  K. 

G. ;  Happe,  T.;  Haumann,  M.  Chem.  Sci.  2014. 

(18)  Florin,  L.;  Tsokoglou,  A.;  Happe,  T.  /.  Biol.  Chem.  2001,  276, 
6125. 

(19)  Fan,  H.  J.;  Hall,  M.  B ./.  Am.  Chem.  Soc.  2001, 123,  3828. 

(20)  Nicolet,  Y.;  de  Lacey,  A.  L.;  Vernede,  X.;  Fernandez,  V.  M.; 
Hatchikian,  E.  C.;  Fontecilla-Camps,  J.  C ./.  Am.  Chem.  Soc.  2001, 123, 
1596. 

(21)  Siegbahn,  P.  E.;  Tye,  J.  W.;  Hall,  M.  B.  Chem.  Rev.  2007,  toy, 
4414. 

(22)  Finkelmann,  A.  R.;  Stiebritz,  M.  T.;  Reiher,  M.  Chem.  Sci.  2014, 
5.  215- 

(23)  Fourmond,  V.;  Greco,  C.;  Sybirna,  K.;  Baffert,  C.;  Wang,  P.-H.; 
Ezanno,  P.;  Montefiori,  M.;  Bruschi,  M.;  Meynial-Salles,  I.;  Soucaille, 
P.  Nat.  Chem.  2014,  6,  336. 

(24)  Fontecilla-Camps,  J.  C.;  Volbeda,  A.;  Cavazza,  C.;  Nicolet,  Y. 
Chem.  Rev.  2007,  toy,  4273. 

(25)  Hong,  G.;  Cornish,  A.;  Hegg,  E.;  Pachter,  R.  Biochim.  Biophys. 
Acta,  Bioenerg.  2011, 1807,  510. 

(26)  Dutta,  A.;  Lense,  S.;  Hou,  J.;  Engelhard,  M.  H.;  Roberts,  J.  A.; 
Shaw,  W.  ].].  Am.  Chem.  Soc.  2013, 135, 18490. 

(27)  Cornish,  A.  J.;  Gartner,  K.;  Yang,  H.;  Peters,  J.  W.;  Hegg,  E.  L. /. 
Biol.  Chem.  2011,  286,  38341. 

(28)  Ginovska-Pangovska,  B.;  Ho,  M.-H.;  Linehan,  J.  C.;  Cheng,  Y.; 
Dupuis,  M.;  Raugei,  S.;  Shaw,  W.  J.  Biochim.  Biophys.  Acta,  Bioenerg. 
2014, 1837, 131. 

(29)  Long,  H.;  King,  P.  W.;  Chang,  C.  H ./.  Phys.  Chem.  B  2014. 

(30)  Lautier,  T.;  Ezanno,  P.;  Baffert,  C.;  Fourmond,  V.;  Cournac,  L.; 
Fontecilla-Camps,  J.  C.;  Soucaille,  P.;  Bertrand,  P.;  Meynial-Salles,  I.; 
Leger,  C.  Faraday  Discuss.  2011, 148,  385. 

(31)  Knorzer,  P.;  Silakov,  A.;  Foster,  C.  E.;  Armstrong,  F.  A.;  Lubitz, 
W.;  Happe,  T.J.  Biol.  Chem.  2012,  287, 1489. 

(32)  Morra,  S.;  Giraudo,  A.;  Di  Nardo,  G.;  King,  P.  W.;  Gilardi,  G.; 
Valetti,  F.  PloS  one  2012,  7,  648400. 

(33)  Mulder,  David  W.;  Shepard,  Eric  M.;  Meuser,  Jonathan  E.;  Joshi, 

N. ;  King,  Paul  W.;  Posewitz,  Matthew  C.;  Broderick,  Joan  B.;  Peters, 
John  W.  Structure  2011, 19, 1038. 

(34)  Zampella,  G.;  Greco,  C.;  Fantucci,  P.;  De  Gioia,  L.  Inorg.  Chem. 
2006,  45,  4109. 

(35)  Mulder,  D.  W.;  Boyd,  E.  S.;  Sarma,  R.;  Lange,  R.  IC;  Endrizzi,  J. 
A.;  Broderick,  J.  B.;  Peters,  J.  W.  Nature  2010,  465,  248. 

(36)  Popescu,  C.  V.;  Miinck,  E ./.  Am.  Chem.  Soc.  1999, 121,  7877. 

(37)  Pereira,  A.  S.;  Tavares,  P.;  Moura,  I.;  Moura,  J.  J.  G.;  Huynh,  B. 

H . /.  Am.  Chem.  Soc.  2001, 123,  2771. 

(38)  Roseboom,  W.;  De  Lacey,  A.  L.;  Fernandez,  V.  M.;  Hatchikian, 
E.  C.;  Albracht,  S.  P.  J Biol.  Inorg.  Chem.  2006,  u,  102. 

11 


ACS  Paragon  Plus  Environment 


Journal  of  the  American  Chemical  Society 


Page  1 2  of  1 3 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 


(39)  Silakov,  A.;  Kamp,  C.;  Reijerse,  E.;  Happe,  T.;  Lubitz,  W. 
Biochemistry  2009,  48,  7780. 

(40)  Patil,  D.  S.;  Moura,  J.  J.  G.;  He,  S.  H.;  Teixeira,  M.;  Prickril,  B.  C.; 
Dervartanian,  D.  V.;  Peck,  H.  D.;  Legall,  ].;  Huynh,  B.  H.  J.  Biol. 
Chem.  1988,  2 63, 18732. 

(41)  Pierik,  A.  J.;  Hagen,  W.  R.;  Redeker,  J.  S.;  Wolbert,  R.  B.; 
Boersma,  M.;  Verhagen,  M.  F.;  Grande,  H.  J.;  Veeger,  C.;  Mutsaers,  P. 
H.;  Sands,  R.  H.;  Dunham,  W.  R.  Eur.  J.  Biochem.  1992,  209,  63. 

(42)  Albracht,  S.  P.  J.;  Roseboom,  W.;  Hatchikian,  E.  C.J.  Biol.  Inorg. 
Chem.  2006, 11,  88. 

(43)  Pierik,  A.  J.;  Hulstein,  M.;  Hagen,  W.  R.;  Albracht,  S.  P.  Eur.  J. 
Biochem.  1998,  258,  572. 

(44)  King,  P.  W.;  Posewitz,  M.  C.;  Ghirardi,  M.  L.;  Seibert,  M.  J. 
Bacterial.  2006, 188,  2163. 

(45)  Yacoby,  I.;  Tegler,  L.  T.;  Pochekailov,  S.;  Zhang,  S.;  King,  P.  W. 
PloS  one  2012,  7,  e35886. 

(46)  Stoll,  S.;  Schweiger,  A./.  Magn.  Reson.  2006, 178,  42. 

(47)  Beinert,  H.;  Orme-Johnson,  W.  In  Magnetic  Resonance  in 
Biological  Systems;  Ehrenberg,  A.,  Malmstrom,  B.,  Vanngard,  T., 
Eds.;  Pergamon  Press  Ltd.:  1967,  p  221. 

(48)  Rupp,  H.;  Rao,  K.  It;  Hall,  D.  O.;  Cammack,  R.  Biochim. 
Biophys.  Acta  1978, 537,  255. 

(49)  Myers,  W.  K.;  Stich,  T.  A.;  Suess,  D.  L.  M.;  Kuchenreuther,  ].  M.; 
Swartz,  J.  R.;  Britt,  R.  D ./.  Am.  Chem.  Soc.  2014. 

(50)  Yu,  L.;  Greco,  C.;  Bruschi,  M.;  Ryde,  U.;  De  Gioia,  L.;  Reiher,  M. 
Inorg.  Chem.  2011,  50,  3888. 

(51)  Happe,  T.;  Kaminski,  A.  Eur.  J.  Biochem.  2002,  269, 1022. 

(52)  Ahlrichs,  R.;  Bar,  M.;  Haser,  M.;  Horn,  H.;  Kolmel,  C.  Chem. 
Phys.  Lett.  1989, 162, 165. 

(53)  Perdew,  J.  P.  Phys.  Rev.  B  1986,  33,  8822. 

(54)  Becke,  A.  D.  Phys.  Rev.  A  1988,  38,  3098. 

(55)  Eichkorn,  K.;  Treutler,  O.;  Ohm,  H.;  Haser,  M.;  Ahlrichs,  R. 
Chem.  Phys.  Lett.  1995,  240,  283. 

(56)  Schafer,  A.;  Huber,  C.;  Ahlrichs,  R.  J.  Chem.  Phys.  1994,  100, 
5829. 

(57)  Adams,  M ./.  Biol.  Chem.  1987,  262, 15054. 

(58)  Greco,  C.;  Bruschi,  M.;  Fantucci,  P.;  Ryde,  U.;  De  Gioia,  L ./.  Am. 
Chem.  Soc.  2011, 133, 18742. 

(59)  Liu,  Z.-P.;  Hu,  P ./.  Am.  Chem.  Soc.  2002, 124,  5175. 

(60)  Tye,  J.  W.;  Darensbourg,  M.  Y.;  Hall,  M.  B.  Inorg.  Chem.  2008, 
47,  2380. 

(61)  Fiedler,  A.  T.;  Brunold,  T.  C.  Inorg.  Chem.  2005,  44,  9322. 

(62)  Kaesz,  H.  D.;  Saillant,  R.  B.  Chemical  Reviews  1972,  72,  231. 

(63)  Vaska,  L ./.  Am.  Chem.  Soc.  1966,  88,  4100. 

(64)  Kubas,  G.  J.;  Burns,  C.  J.;  Khalsa,  G.  R.  K.;  Van  Der  Sluys,  L.  S.; 
Kiss,  G.;  Hoff,  C.  D.  Organometallics  1992, 11,  3390. 

(65)  van  der  Vlugt,  J.  I.;  Rauchfuss,  T.  B.;  Whaley,  C.  M.;  Wilson,  S. 
R.  J.  Am.  Chem.  Soc.  2005, 127, 16012. 

(66)  Stripp,  S.;  Sanganas,  O.;  Happe,  T.;  Haumann,  M.  Biochemistry 
2009,  48,  5042. 

(67)  Hajj,  V.;  Baffert,  C.;  Sybirna,  It;  Meynial-Salles,  I.;  Soucaille,  P.; 
Bottin,  H.;  Fourmond,  V.;  Leger,  C.  Energy  Environ.  Sci.  2014,  7,  715. 

(68)  Schwab,  D.  E.;  Tard,  C.;  Brecht,  E.;  Peters,  J.  W.;  Pickett,  C.  J.; 
Szilagyi,  R.  K.  Chem.  Commun.  (Cambridge,  U.  K.)  2006,  3696. 

(69)  Greco,  C.;  Bruschi,  M.;  Heimdal,  J.;  Fantucci,  P.;  De  Gioia,  L.; 
Ryde,  U.  Inorg.  Chem.  2007,  46,  7256. 

(70)  Silakov,  A.;  Reijerse,  E.  ].;  Albracht,  S.  P.  J.;  Hatchikian,  E.  C.; 
Lubitz,  W ./.  Am.  Chem.  Soc.  2007, 129, 11447. 

(71)  Bruschi,  M.;  Greco,  C.;  Kaukonen,  M.;  Fantucci,  P.;  Ryde,  U.;  De 
Gioia,  L.  Angew.  Chem.  Int.  Ed.  2009,  48,  3503. 

(72)  Trohalaki,  S.;  Pachter,  R.  Int.  J.  Hydrogen  Energy  2010, 35,  5318. 

(73)  Giles,  L.  ].;  Grigoropoulos,  A.;  Szilagyi,  R.  K.  Eur.  J.  Inorg.  Chem. 
2011,  2  011,  2677. 

(74)  Miyake,  T.;  Bruschi,  M.;  Cosentino,  U.;  Baffert,  C.;  Fourmond, 
V.;  Leger,  C.;  Moro,  G.;  De  Gioia,  L.;  Greco,  C.  J.  Biol.  Inorg.  Chem. 
2013, 18,  693. 

(75)  Winkler,  M.;  Esselborn,  J.;  Happe,  T.  Biochim.  Biophys.  Acta, 
Bioenerg.  2013, 1827,  974. 


(76)  Erdem,  O.  F.;  Stein,  M.;  Kaur-Ghumaan,  S.;  Reijerse,  E.  J.;  Ott, 
S.;  Lubitz,  W.  Chem.  Eur.  J.  2013, 19, 14566. 

(77)  Lemon,  B.  J.;  Peters,  J.  W.  Biochemistry  1999, 38, 12969. 

(78)  Erbes,  D.  L.;  Burris,  R.;  Orme-Johnson,  W.  Proc.  Natl.  Acad.  Sci. 
U.  S.  A.  1975,  72,  4795. 

(79)  Chen,  Z.;  Lemon,  B.  J.;  Huang,  S.;  Swartz,  D.  J.;  Peters,  J.  W.; 
Bagley,  K.  A.  Biochemistry  2002,  41,  2036. 

(80)  Bennett,  B.;  Lemon,  B.  J.;  Peters,  J.  W.  Biochemistry  2000,  39, 
7455- 


12 


ACS  Paragon  Plus  Environment 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 


Journal  of  the  American  Chemical  Society 


ACS  Paragon  Plus  Environment 


13 


[FeFe]-  and  [NiFe]-Hydrogenase  Diversity,  Mechanism,  and  Maturation 


John  W.  Peters1’5*,  Gerrit  J.  Schut2'5,  Eric  S.  BoycE’5,  David  W.  Mulder4’5,  Eric  S.  Shepard1,  Joan 
B.  Broderick1,  Paul  W.  King4’5,  and  Michael  W.  W.  Adams2'5 

department  of  Chemistry  and  Biochemistry,  Montana  State  University,  Bozeman,  MT  59717; 
department  of  Biochemistry  and  Molecular  Biology,  University  of  Georgia,  Athens,  GA  30602; 
department  of  Microbiology  and  Immunology,  Montana  State  University,  Bozeman,  MT 

59717; 

4National  Renewable  Energy  Laboratory,  Golden  CO  80401; 

"The  Biological  Electron  Transfer  and  Catalysis  Energy  Frontiers  Research  Center 

*  Corresponding  Author  -  John  W.  Peters,  iohn.peters@chemistry.montana.edu,  VOICE  406- 
994-7212,  FAX  406-994-5407. 


1 


Abstract 


The  [FeFe]-  and  [NiFe]-hydrogenases  catalyze  the  formal  interconversion  between  hydrogen  and 
protons  and  electrons,  possess  characteristic  non-protein  ligands  at  their  catalytic  sites  and  thus 
share  common  mechanistic  features.  Despite  the  similarities  between  these  two  types  of 
hydrogenases,  they  clearly  have  distinct  evolutionary  origins  and  likely  emerged  from  different 
selective  pressures.  [FeFe]-hydrogenases  are  widely  distributed  in  fermentative  anaerobic 
microorganisms  and  likely  evolved  under  selective  pressure  to  couple  hydrogen  production  to  the 
recycling  of  electron  carriers  that  accumulate  during  anaerobic  metabolism.  In  contrast,  many 
[NiFe]-hydrogenases  catalyze  hydrogen  oxidation  as  part  of  energy  metabolism  and  were  likely 
key  enzymes  in  early  life  and  arguably  represent  the  predecessors  of  modern  respiratory 
metabolism.  Although  the  reversible  combination  of  protons  and  electrons  to  generate  hydrogen 
gas  is  the  simplest  of  chemical  reactions,  the  [FeFe]-  and  [NiFe]-hydrogenases  have  distinct 
mechanisms  and  differ  in  the  fundamental  chemistry  associated  with  proton  transfer  and  control 
of  electron  flow  that  also  help  to  define  catalytic  bias.  A  unifying  feature  of  these  enzymes  is 
that  hydrogen  activation  itself  has  been  restricted  to  one  solution  involving  diatomic  ligands 
(carbon  monoxide  and  cyanide)  bound  to  an  Fe  ion.  On  the  other  hand,  and  quite  remarkably, 
the  biosynthetic  mechanisms  to  produce  these  ligands  are  exclusive  to  each  type  of  enzyme. 
Furthermore,  these  mechanisms  represent  two  independent  solutions  to  the  formation  of  complex 
bioinorganic  active  sites  for  catalyzing  the  simplest  of  chemical  reactions,  reversible  hydrogen 
oxidation.  As  such,  the  [FeFe]-  and  [NiFe]-hydrogenases  are  arguably  the  most  profound  case  of 
convergence  evolution. 
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Introduction 


The  enzyme  hydrogenase  utilizes  hydrogen  (H2)  as  a  substrate  or  produces  H2  by  the 
reduction  of  protons.  The  two  main  types  of  hydrogenases  are  classified  by  the  nature  of  the 
metal  clusters  at  their  catalytic  sites  and  are  termed  [FeFe]-  and  [NiFe]-hydrogenases.  Their 
active  sites  have  structural  characteristics  in  common  including  the  presence  of  carbon  monoxide 
(CO)  and  cyanide  (CN')  groups  bound  to  the  iron  (Fe)  ions.  The  [FeFe] -hydrogenase  active  site 
cluster  is  termed  the  H  cluster,  and  exists  as  a  regular  [4Fe-4S]  subcluster  bridged  to  a  2Fe 
subcluster  unit  through  a  bridging  cysteine  thiolate.  The  2Fe  subcluster  is  also  coordinated  by 
the  non-protein  ligands  carbon  monoxide,  cyanide  and  dithiomethylamine,  thereby  giving  it  a 
distinctly  organometallic  character.  The  Fe  atom  at  the  active  site  of  the  [NiFe]-hydrogenase  has 
a  similar  architecture  as  one  of  the  Fe  ions  of  the  2Fe  subcluster  with  CO  and  CN"  ligation  but  it 
is  bridged  to  a  Ni  atom  through  bridging  cysteine  thiolates.  The  Ni  atom  is  in  turn  terminally 
coordinated  by  two  additional  cysteine  thiolates.  In  some  hydrogenases  one  of  the  two  terminal 
cysteines  is  replaced  by  selenocysteine  and  these  so-called  [NiFeSej-hydrogenases  are 
considered  variations  of  [NiFe]-hydrogenases.  Both  [FeFe]-  and  [NiFe]-hydrogenases  typically 
contain  multiple  iron-sulfur  clusters  that  exist  as  either  cysteine  coordinated  [4Fe-4S]  or  [2Fe- 
2S]  clusters  and  in  some  cases  there  is  mixed  coordination  with  a  His  substituting  for  one  of  the 
Cys  residues.  As  conduits  between  the  catalytic  metal  sites  and  external  electron  donors  and 
acceptors,  the  composition  of  these  iron-sulfur  clusters  varies  among  different  phylogenetic 
clades  that  in  large  part  delineate  hydrogenases  on  the  basis  of  physiological  function. 

In  the  case  of  both  [FeFe]-  and  [NiFe]-hydrogenases,  the  CO  and  CN"  ligands  at  their 
catalytic  sites  function  as  strong  71-acceptor  ligands  that  facilitate  low-spin,  II/I  oxidation  states 
of  the  Fe  atoms.  In  all  cases,  the  hydrogenases  require  efficient  proton-transfer  (PT),  and  proton- 
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coupled  electron  transfer  (PCET)  to  achieve  fast  H2  activation  rates  of  turnovers  of  ~  1  O  '- 1 04  s’1 
[1-4].  This  functional  necessity  is  satisfied  by  a  secondary  coordination  sphere  that  includes 
conserved  polar  residues  with  exchangeable  sites  for  transferring  protons  to  the  redox-active 
bimetallic  clusters  [5-8],  In  addition,  these  conserved  residues  can  also  participate  in  H-bonding 
interactions  to  stabilize  cofactor  orientations  and  tune  electronic  properties.  Altogether  these 
extended  interactions  are  critical  for  creating  a  catalytic  reaction  coordinate  that  minimizes  high- 
energy  steps,  or  kinetic  barriers  that  would  otherwise  constrain  enzymatic  function  and 
efficiency. 

For  the  most  part,  hydrogenases  function  in  recycling  reduced  electron  carriers  that 
accumulate  during  anaerobic  fennentation  through  proton  reduction  or  in  coupling  FF  oxidation 
to  energy  yielding  processes.  The  [FeFe] -hydrogenases  are  typically  associated  with  proton 
reduction  and  [NiFe]-hydrogenases  with  FF  oxidation,  although  there  are  a  number  of  exceptions 
[9,  10].  Despite  sharing  the  unique  features  of  CO  and  CN-  ligands  to  active  site  Fe  ions,  the 
[FeFe]-  and  [NiFe]-hydrogenases  are  not  evolutionarily  related  and  show  no  sequence  similarity 
[11,  12].  In  general,  [FeFe]-hydrogenases  are  found  in  anaerobic  bacteria  and  are  especially 
prevalent  among  the  fennentative  organisms  (e.g.,  firmicutes).  They  are  also  found  in  a  number 
of  lower  eukaryotes  including  algae  and  protists  but  surprisingly  they  have  yet  to  be  found  in 
cyanobacteria  or  in  the  archaeal  domain.  In  contrast,  [NiFe]-hydrogenases,  are  frequently  found 
associated  with  cyanobacteria  and  archaea,  in  addition  to  their  common  occurrence  in  a  large 
number  of  bacteria.  A  variety  of  organisms  harbor  multiple  hydrogenases  [11-14].  In  some 
cases,  this  may  provide  functional  redundancy  but  it  is  thought  in  general  they  likely  have 
different  roles.  Some  bacteria,  again  largely  firmicutes  and  sulfate  reducing  bacteria,  even 
harbor  both  [NiFe]-  and  [FeFe] -hydrogenases  [11,  12]. 
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The  [FeFe]-hydrogenases  are  closely  related  to  a  protein  found  only  in  eukaryotes  termed 
Narl  or  Narf  [15,  16].  This  protein  has  been  implicated  in  having  a  role  in  cytoplasmic  iron- 
sulfur  cluster  biosynthesis  and/or  repair,  and  appears  to  resemble  a  minimal  [FeFe]-hydrogenase 
that  lacks  a  2Fe  subcluster.  Homology  models  of  Narf  suggest  the  presence  of  an  open  cavity 
adjacent  to  the  [4Fe-4S]  cubane  that  could  accommodate  the  2Fe  subcluster  [17].  We  have 
proposed  [18]  that  Narf  proteins  and  [FeFe]-hydrogenases  have  a  common  ancestor  that  more 
likely  resembles  the  simpler  Narf-like  protein  and  that  the  biosynthetic  pathway  for  assembly 
and  insertion  of  the  2Fe  subcluster  evolved,  at  least  in  part,  in  response  to  the  characteristics  of 
the  vacant  cavity  present  in  the  ancestral  enzyme  and  fill  it  with  a  organometallic  cluster  capable 
of  catalyzing  new  biochemistry.  Whatever  the  merits  of  this  proposal,  in  general  [FeFe]- 
hydrogenases  do  not  occur  universally  in  deeply  rooted  lineages  of  ribosomal  RNA-based 
phylogenetic  reconstructions  and  they  are  not  commonly  associated  with  metabolisms  considered 
by  many  to  be  ancestral  (e.g.,  chemolithotrophy  or  methanogenesis).  This  indicates  that  it 
unlikely  that  [FeFe]-hydrogenases  were  a  property  of  the  Last  Universal  Common  Ancestor 
(LUCA)  and  are  perhaps  not  old  from  an  evolutionary  perspective  [18], 

As  discussed  below  and  in  contrast  to  the  [FeFe]-enzymes,  the  [NiFe]-hydrogenases  are 
encoded  in  the  genomes  of  many  Bacteria  and  Archaea  and  are  frequently  detected  in  deeply 
rooted  lineages  of  both  domains.  For  this  reason,  one  can  make  an  argument  that  [NiFe]- 
hydrogenases  played  a  key  role  in  the  metabolism  of  the  LUCA  [19],  The  closest  evolutionarily 
relative  of  [NiFe]-hydrogenases  are  proteins  involved  in  respiratory  Complex  I  /  NADH 
dehydrogenase  [20,  21].  Subunits  comprising  NADH  dehydrogenase  are  close  relatives  of  the 
so-called  group  4  membrane  bound  [NiFe]-hydrogenases  that  couple  the  oxidation  of  ferredoxin 
(Fd)  to  proton  reduction  which  is  in  turn  coupled  to  generating  a  membrane  potential  through  ion 
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pumping.  This  mechanism  is  analogous  to  the  mechanism  of  Complex  I  in  coupling  NADH 
oxidation  and  ubiquinone  reduction  to  proton  pumping.  This  suggests  that  ion  pumping 
membrane  [NiFe]-hydrogenases  are  the  ancestors  of  modern  respiratory  Complex  I  and 
chemiosmosis. 

Given  that  both  [FeFe]-  and  [NiFe]-hydrogenases  have  independently  evolved  essentially  the 
same  unique  solution  to  catalyzing  reversible  FF  oxidation  chemistry  (carbon  monoxide  and 
cyanide  ligands  to  Fe  ions)  is  somewhat  of  a  surprise.  However  evidence  for  independent 
evolutionary  origins  is  also  supported  in  the  stark  difference  in  the  mechanisms  by  which  these 
unique  carbon  monoxide  and  cyanide  ligands  are  synthesized  in  their  respective  cluster 
maturation  pathways  [22-27].  The  process  by  which  the  CO,  CN",  and  other  nonprotein  ligands 
are  synthesized,  coordinated  to  iron,  and  are  inserted  into  the  active  sites  of  [FeFe]-  and  [NiFe]- 
hydrogenases  to  yield  the  mature  enzymes  clearly  represents  two  different  paradigms  in  metal 
cluster  biosynthetic  pathways.  These  two  independent  paths  for  the  evolution  of  these  unrelated 
enzymes  perhaps  represents  one  of  the  most  profound  cases  of  convergence  evolution  and  one 
that  is  supported  by  differences  in  the  taxonomic  distribution  of  these  enzymes,  the  pathways 
required  to  synthesize  active  site  co-factors,  and  their  physiological  roles. 


Diversity 

Our  recent  screening  of  2919  complete  bacterial  and  archaeal  genomes  available  in  July 
2014  (Boyd,  E.S.,  unpublished  data)  indicate  the  presence  of  [FeFe]-hydrogenase  homologs  in 
265  of  those  genomes  (9.1%  of  total)  while  778  (26.7%  of  total)  of  those  genomes  encode 
homologs  of  [NiFe]-hydrogenases.  Like  previous  studies  [11,  12,  14],  we  found  that  [FeFe]-  and 
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[NiFe]-hydrogenase  homologs  are  discretely  distributed  at  the  domain,  phylum,  and  order  levels 
of  taxonomic  classifications.  While  [FeFe]-hydrogenases  are  encoded  in  the  genomes  of 
anaerobic  bacteria  and  anaerobic  or  phototrophic  eukaryotes,  they  are  not  encoded  by  archaea. 
In  contrast,  [NiFe]-hydrogenases  are  encoded  by  both  aerobic  and  anaerobic  bacteria  and  archaea 
but  eukarya  (Fig.  1). 

Genes  encoding  homologs  of  [FeFe]-hydrogenase  were  identified  in  the  genomes  of 
anaerobic  members  of  several  phyla  (Fig.  1),  including  the  Actinobacteria  (7  /  301  genomes), 
Firmicutes  (160  /  622  genomes),  Bacteroides  (9  /  94  genomes),  Chloroflexi  (11/24  genomes), 
Proteobacteria  (28  /1240  genomes),  Spirochaetes  (20  /  65  genomes),  Dictyoglomi  (2/2 
genomes),  Ignavibacteriae  (2/2  genomes),  and  Thermotogae  (18  /1 8  genomes).  In  addition, 
[FeFej-hydrogenase  homologs  were  identified,  albeit  infrequently,  among  members  of 
Synergistetes,  Verrucomicrobia,  Nitrospirae,  and  Fusobacteria.  Intriguingly,  homologs  were  not 
identified  among  the  Chlamydia  (109  genomes),  Chlorobi  (11  genomes),  Cyanobacteria  (79 
genomes),  Deinococcus-Thermus  (21  genomes),  Tenericutes  (89  genomes),  or  Planctomycetes  (8 
genomes).  Among  eukaryotes,  [FeFej-hydrogenase  were  identified  in  the  genomes  of  several 
anaerobic  protists  ( Trichomonas ,  Nyctotherus),  Fungi  ( Neocallimastix ,  Sclerotinia),  and  green 
algae  (Vo  I  vox,  Chlamydomonas). 

Genes  encoding  homologs  of  [NiFej-hydrogenase  were  more  evenly  distributed  among 
bacterial  and  archaeal  phyla  when  compared  to  [FeFej-hydrogenase  (Fig.  1).  Among  archaea, 
the  genomes  of  all  members  of  the  crenarchaeal,  euryarchaeal,  and  korarchaeal  taxonomic  orders 
contain  at  least  one  [NiFej-hydrogenase  homolog  while  genomes  from  members  of  the 
Nanoarchaea  and  Thaumarchaea  do  not.  Among  the  Crenarchaea,  homologs  were  identified  in 
13  of  the  35  genomes,  whereas  70  of  the  110  euryarchaeal  genomes  encoded  [NiFej-hydrogenase 
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homologs.  Among  bacterial  phyla,  [NiFe]-hydrogenase  homologs  were  common  in  the 
Actinobacteria  (72  /  303  genomes),  Aquificae  (13  /  14  genomes),  Chlorobi  (10  /  11  genomes), 
Chloroflexi  (30  /  35  genomes),  Cyanobacteria  (43  /  79  genomes),  Firmicutes  (86  /  622 
genomes),  and  Proteobacteria  (497  /  1240  genomes).  In  addition,  homo  logs  were  identified 
among  members  of  the  Acidobacteria,  Chrysiogenetes,  Deferribacteres,  Deinococcus-Thermus, 
Dictyoglomi,  Elusimicrobia,  Nitrospirae,  Planctomycetes,  Synergistetes,  Thermodesulfobacteria, 
Thermotogae,  and  Verrucomicrobia.  [NiFe]-hydrogenase  homologs  were  not  identified  in 
members  of  the  Chlamydia  (109  genomes),  Fusobacteria  (8  genomes),  Spirochaetes  (65 
genomes),  or  Tenericutes  (89  genomes).  As  previously  noted  [12],  the  presence  of  both  [FeFe]- 
and  [NiFe]-hydrogenase  in  the  same  genome  is  rare  and  is  found  among  members  of  the 
bacterial  phyla  Actinobacteria,  Bacteroidetes,  Chloroflexi,  Dictyoglomi,  Firmicutes,  Nitrospirae, 
Proteobacteria,  Thermodesulfobacteria,  and  Thermotogae,  suggesting  unique  functional  roles 
for  these  enzymes  in  the  metabolism  of  each  organism. 

Phylogenetic  and  Functional  Diversity  of  [FeFe]-hydrogenases 

[FeFe]-hydrogenase  are  delineated  from  paralogous  proteins  present  in  eukaryotes,  such 
as  Narl,  by  the  presence  of  three  conserved  cysteine-containing  motifs  that  coordinate  the  active 
site  H-cluster.  These  consist  of  TSCCPxW  (LI),  MPCxxKxxE  (L2)  and  ExMxCxxGCxxG  (L3) 
in  [FeFe]-hydrogenase,  and  although  some  sequence  variation  does  occur  in  these  motifs,  the 
bolded  cysteine  residues  are  essential  for  coordination  of  the  H-cluster  [11].  In  addition  to 
variation  in  the  cluster  binding  motifs,  previous  studies  have  noted  substantial  sequence  variation 
in  the  motifs  that  contain  the  cysteine  residues  that  bind  the  N-  and  C-terminal  Fe-S  cluster  and 
accessory  cofactor  binding  domains  [13,  14]  (Fig.  2),  suggesting  potential  interactions  with  a 
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variety  of  redox  partners.  Due  to  the  substantial  N  and  C-tenninal  cluster  variation,  only  the  H- 
cluster  is  included  in  phylogenetic  reconstructions  of  the  catalytic  subunit,  which  is  termed 
HydA.  Although  the  branching  order  of  some  H-cluster  lineages  mirror  that  of  taxonomic 
reconstructions,  most  organisms  with  hydA  encode  multiple  copies  that  often  belong  to  divergent 
lineages,  implying  a  number  of  gene  duplications  and  horizontal  gene  transfers  during  the 
functional  diversification  of  this  protein  class.  As  an  example,  individual  clostridial  genomes 
each  encode  for  up  to  13  divergent  hydA  homologs,  suggesting  that  these  enzymes  have  unique 
functional  roles  in  these  organisms.  In  addition  to  variation  at  the  level  of  primary  sequence, 
disparities  in  the  number  of  [FeFe]-hydrogenase  subunits  and  their  tertiary  structure  have  been 
identified  [13,  14]. 

The  best  characterized  [FeFe]-hydrogenases  are  monomeric,  ferredoxin-dependent 
enzymes  [12].  Examples  include  the  hydrogenases  from  eukaryotic  algae,  such  as 
Chlamydomonas  reinhardtii,  and  clostridial  species,  such  as  Clostridium  pasteurianum.  C. 
reinhardtii  [FeFe]-hydrogenase  HydAl  (OHydA  1 )  contains  only  the  H-cluster  and  no  additional 
iron-sulfur  clusters  [14].  Like  bacteria,  algae  often  encode  multiple  homologs  of  [FeFe]- 
hydrogenase.  However,  unlike  bacteria,  homologs  in  a  given  algal  taxon  are  monophyletic 
relative  to  other  algal  or  bacterial  HydA,  implying  independent  gene  duplications  in  each  of  the 
algal  lineages  [28].  Given  the  structural  simplicity  of  the  C.  reinhardtii  hydrogenase  it  has 
become  the  model  for  active  site  maturation  studies  ([18,  29,  30]).  Phylogenetic  analyses  indicate 
that  the  H-cluster  domain  of  algal  [FeFe] -hydrogenase  is  not  likely  to  be  ancestral,  rather  it  is 
likely  that  the  ancestral  enzyme  had  a  single  ferredoxin  binding  domain  at  the  N-tenninus  which 
was  lost  when  these  genes  were  laterally  transferred  to  algae  [18].  The  simple  algal  enzyme 
containing  only  the  H-cluster  is  thought  to  function  during  the  fennentation  of  carbohydrate 
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reserves  [31,  32]  although  photosynthetic  H2  production  was  recently  demonstrated  under  fully 
aerobic  conditions  in  the  alga  Chlorella  vulgaris  [33]. 

C.  pasteurianum  contains  [FeFe]-hydrogenases  of  the  ferredoxin  (Fd)-dependent 
monomeric  (M3)  structural  category  [34]  (Fig.  2).  C.  pasteurianum  ferments  3  mol  glucose  to  2 
mol  acetate,  2  mol  butyrate,  4  mol  CO 2  and  8  mol  H2.  In  addition,  reduced  ferredoxin  (Fdred)  is 
generated  by  the  bifurcating  butyryl-CoA  dehydrogenase,  which  helps  to  explain  how  8  moles  of 
H2  are  produced  instead  of  the  6  mol  that  would  be  expected  based  on  thermodynamic 
considerations  [35].  The  Fd-dependent  [FeFe]  hydrogenases  are  widespread  in  anaerobic 
bacteria,  in  particular  in  the  order  Firmicutes  [11,  13].  More  complex,  multimeric  [FeFe]- 
hydrogenases  have  been  identified  in  Thermotoga  maritima,  Thermoanaerobacter  tengcongensis 
and  Desulfovibrio  fructosovorans  and  these  have  been  implicated  as  NAD(P)H-linked  H2 
producing  enzymes  [36-38].  The  trimeric  hydrogenase  of  T.  maritima  was  shown  not  to  couple 
H2  formation  with  the  oxidation  of  reduced  Fd  or  of  NADH  when  each  was  used  as  the  sole 
electron  donor.  However,  the  presence  of  both  reduced  Fd  and  NADH  promoted  efficient  H2 
production  [39].  The  key  to  the  fonnation  of  H2  from  Fd  and  NADH  simultaneously  is  the 
coupling  of  their  energetics;  the  exergonic  reduction  of  protons  by  electrons  derived  from  Fd 
allows  the  endergonic  reduction  of  protons  by  electrons  from  NADH.  This  type  of  reaction 
mechanism  is  called  electron  bifurcation  and  represents  the  third  form  of  energy  conservation 
[35,39], 

Energy  conservation  by  electron  bifurcation  was  first  proposed  in  2008  [40]  and  is 
defined  as  an  exergonic  reaction  driving  an  endergonic  reaction  without  the  involvement  of  an 
ion  gradient.  The  clostridial  cytoplasmic  enzyme  complex  butyryl-CoA  dehydrogenase  catalyzes 
the  reduction  of  crotonyl-CoA  (E0’  =  -10  mV)  to  butyryl-CoA  with  NADH  (E0’  =  -320  mV)  as 
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the  electron  donor  and  this  is  coupled  to  the  endergonic  reduction  of  Fd  (Em  =  -410  mV)  by 
NADH  [40,  41].  Reduced  Fd  is  then  recycled  to  produce  H2  (E0’  =  -420  mV).  As  a  consequence 
of  this  pair  of  linked  redox  reactions,  the  oxidation  of  NADH  can  be  coupled  to  H2  production 
and  this  ultimately  leads  to  the  fonnation  of  additional  ATP.  The  discovery  of  electron 
bifurcation  can  explain  important  reactions  in  anaerobic  microbes,  such  as  in  clostridia  and 
methanogens  [42], 

The  heterotrimeric  [FeFe]-hydrogenase  from  T.  maritima  (Topt  80°C)  utilizes  the 
exergonic  oxidation  of  Fd  [(Em  =  -453  mV  (at  80°C),  [43,  44]]  to  drive  the  unfavorable  oxidation 
of  NADH  (E°’  =  -320  mV)  to  produce  H2  (E0’  =  -420  mV).  The  overall  reaction  is: 

NADH  +  2Fdred  +  3H+  -»  2H2  +  NAD+  +  2  Fdox 

This  bifurcating  hydrogenase  provided  new  insights  into  H2  production  by  anaerobes.  For 
example,  Caldicellusiruptor  and  other  Thermoanaerobacteriales  contain  two  types  of 
hydrogenase,  a  soluble  multimeric  [FeFe] -hydrogenase  similar  to  the  T.  maritima  bifurcating 
hydrogenase  and  a  group  4  membrane  bound  Fd-dependent  [NiFe]-hydrogenase.  These  enzymes 
were  proposed  to  function  together  to  evolve  H2  during  sugar  metabolism  [37].  However,  genetic 
analyses  of  T.  saccharolyticum  showed  that  the  membrane  bound  [NiFe] -hydrogenase  appeared 
not  to  be  involved  in  H2  production  while  deletion  of  the  multimeric  [FeFe]-hydrogenase 
resulted  in  decreased  H2  production  [45].  This  indicates  that  these  organisms  also  utilize  a 
bifurcating  [FeFe] -hydrogenase  for  H2  production  and  the  non-bifurcating  [NiFe]-hydrogenase 
could  have  a  secondary  metabolic  role. 

Bifurcating  [FeFe]-hydrogenases  also  play  a  central  role  in  energy  conservation  in 
acetogens  [46],  Acetobacterium  woodii  uses  a  respiratory  system  for  electron  transfer  between 
Fd  and  NAD  as  the  main  energy-conserving  mechanism  for  autotrophic  growth  on  H2  and  carbon 
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dioxide  [47,  48].  However,  the  mechanism  for  how  reduced  Fd  is  generated  from  H2  only 
became  clear  through  the  characterization  of  its  bifurcating  [FeFe]-hydrogenase,  which  functions 
to  couple  H2  oxidation  to  the  simultaneous  production  of  reduced  Fd  and  NADH  [46]. 

Bifurcating  [FeFe]-hydrogenases  can  in  principle  function  in  vivo  in  both  the  production 
and  oxidation  of  H2,  as  was  recently  shown  for  a  reversible  bifurcating  hydrogenase  in  Moorella 
thermoacetica  [49].  During  growth  on  glucose,  M.  thermoacetica  can  intennittingly  produce  H2 
from  Fd  and  NADH  via  the  bifurcating  hydrogenase.  Likewise,  this  same  hydrogenase  can 
utilize  produced  H2  and  C02  to  perform  acetogenesis  generating  3  mol  of  acetate  per  glucose 
molecule.  Although  M.  thermoacetica  does  not  grow  well  autotrophically  on  H2  and  C02,  it  uses 
the  bifurcating  hydrogenase  to  produce  reduced  Fd  to  drive  acetogenesis.  In  addition  to  Fd-  and 
NADH-dependent  hydrogenases,  a  Fd-  and  NADPH-dependent  enzyme  was  characterized  from 
Clostridium  autoethanogenum  cultured  on  CO  and  it  was  proposed  that  this  hydrogenase 
functions  in  the  formation  of  H2  [50].  Moreover,  the  genome  of  C.  autoethanogenum  encodes  for 
two  additional  bifurcating  [FeFe] -hydrogenases  that  could  function  in  providing  reduced  Fd  and 
NADH  during  autotrophic  growth  on  H2  and  C02  [51].  Bifurcating  hydrogenases  might  also  play 
an  important  role  in  the  gut  microbiota.  Ruminoccoccus  species,  which  are  involved  in  the 
degradation  of  recalcitrant  biomass,  contain  [FeFe]-hydrogenases  very  similar  to  the  bifurcating 
hydrogenase  of  T.  maritima  [52].  Anaerobic  gut-dwelling  protists  and  fungi  involved  in  biomass 
degradation  also  encode  [FeFe]-hydrogenases  to  recycle  their  electron  carriers  that  bear  a 
similarity  to  the  bifurcating  [FeFe]-hydrogenases  of  anaerobic  bacteria  [53-55] 
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Phylogenetic  and  Functional  Diversity  of  [NiFe]-hydrogenases 

[NiFe]-hydrogenases  from  a  variety  of  microorganisms  have  been  biochemically 
characterized  [56,  57].  These  enzymes  are  comprised  of  at  least  two  subunits,  with  the  large 
subunit  (LSU)  containing  the  NiFe-catalytic  site  and  the  small  subunit  (SSU)  containing  three 
highly  conserved  iron-sulfur  clusters  that  serve  to  shuttle  electrons  between  the  external  electron 
carrier  and  the  NiFe  site.  Based  on  comparison  of  the  sequence  motifs  that  coordinate  the  active 
site,  [NiFe]-hydrogenases  can  be  separated  into  four  distinct  functional  groups  that  in  general 
correspond  to  their  putative  physiological  role  [12].  Intriguingly,  phylogenetic  reconstructions  of 
representatives  of  these  enzymes  also  reveal  patterns  of  clustering  that  correspond  to  variations 
in  NiFe  cluster  binding  motifs  and  their  inferred  physiological  role  [12,  13]  (Fig.  3). 

Group  1 

Enzymes  within  this  group  are  typically  membrane -bound  hydrogenases  and  are  found  in 
organisms  that  use  FE  as  an  energy  source.  A  cytochrome  b  is  often  also  present  to  anchor  the 
hydrogenase  complex  to  the  membrane  and  allow  electron  transfer  to  the  quinone  pool.  These 
enzymes  couple  the  oxidation  of  FE  to  the  reduction  of  electron  acceptors  such  as  oxygen, 
nitrate,  sulfate,  CCE  or  oxidized  sulfur  compounds  [12].  E.  co/i  contains  2  homologs  of  this 
group  and  one  of  these  (Hya)  is  thought  to  oxidize  H?  and  transfer  electrons  to  the  quinone  pool 
to  provide  the  cell  with  additional  energy  [58].  The  sulfate  reducing  bacterium  Desulfovibrio 
gigas  encodes  two  [NiFe]-hydrogenases.  One  is  a  membrane  bound  Ech-type  group  4  enzyme 
while  the  other  is  a  periplasmic  group  1  enzyme  (HynAB).  By  mutational  analysis  it  was  shown 
that  the  periplasmic  hydrogenase  is  essential  for  growth  on  FE  with  sulfate  and  functions  to 
donate  electrons  from  FE  to  the  quinone  pool  via  a  periplasmic  cytochrome  c  [59].  Aquifex 
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aolicus  is  one  the  most  thennophilic  bacteria  known  and  it  gains  energy  by  oxidizing  H2  using 
low  concentrations  of  oxygen  or  oxidized  sulfur  compounds  as  electron  acceptors  [60],  Its 
genome  encodes  two  trimeric  uptake  (group  1)  [NiFe]-hydrogenases  (Mbhl  and  Mbhll)  and 
these  have  distinct  physiological  functions  [61].  Mbhl  is  involved  in  the  oxidation  of  H2  linked  to 
oxygen  reduction  while  H2  oxidation  by  Mbhll  is  coupled  to  elemental  sulfur  reduction  [62].  A 
similar  H2-oxidizing  and  elemental  sulfur  reducing  respiratory  system  was  described  in  the 
archaeon  Acidianus  ambiva/ens  [63]  but  its  genome  sequence  is  not  available  and  so  the  diversity 
of  its  hydrogenases  is  currently  not  resolved. 

Group  2 

Group  2a:  This  subgroup  includes  hydrogenases  of  cyanobacteria  that  fix  dinitrogen  (N2)  using 
nitrogenase.  These  organisms  contain  a  dimeric  uptake  hydrogenase  (HupSL)  to  recapture 
energy  lost  as  Tb  during  the  reduction  of  No  [64].  Electrons  from  H2  are  then  channeled  into  the 
quinone  pool  or  function  to  reduce  02  that  deactivates  the  oxygen  sensitive  nitrogenase  enzyme 
[65,  66].  Mutation  of  the  HupLS  genes  in  the  diazotroph  Anabaena  siamensis  abolished  H2 
uptake  activity  and  led  to  an  increase  in  light-dependent  H2  production,  presumably  due  to  the 
lack  of  recapture  of  H2  produced  by  nitrogenase  enzyme  in  the  absence  of  the  uptake 
hydrogenase  [67]. 

Group  2b:  Group  2b  consists  of  H2-sensing  regulatory  hydrogenases  that  function  to  signal  the 
availability  of  H?  to  the  transcriptional  regulation  of  metabolic  hydrogenases,  which  themselves 
are  most  often  affiliated  with  Group  1  H2  oxidizing  enzymes  [11].  A  very  well  studied  model  is 
the  H2  oxidizing  aerobe  Rcilstonia  eutropha,  which  contains  three  different  hydrogenases.  In  this 
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case  its  H2  sensing  hydrogenase  HoxBC  regulates  the  expression  of  the  membrane  bound 
(HoxKGZ)  and  the  cytoplasmic  NAD-dependent  (HoxFUYH)  enzymes  [68]. 

Group  3 

Group  3a:  The  F42o  reducing  hydrogenases  (Frh)  are  exclusively  found  in  methanogenic  archaea 
where  they  function  in  coupling  the  oxidation  of  FF  to  the  reduction  of  cofactor  F420,  thereby 
supplying  reductant  for  two  steps  in  the  reduction  of  C02  to  methane  [11].  These  enzymes 
consist  of  a  large  subunit  (FrhA),  a  small  subunit  (FrhG),  and  a  FAD  containing-subunit  (FrhB) 
for  electron  transfer  to  F42o  [69,  70]. 

Group  3b:  Group  3b  enzymes  are  tetrameric  (a(3y5)  and  are  found  primarily  in  thermophilic 
archaea.  Their  a  and  5  subunits  represent  the  minimal  [NiFe]-hydrogenase  structure,  with  the 
two  other  subunits  (P  and  y)  containing  iron-sulfur  clusters  and  a  NAD(P)/FAD  binding  domain 
[12].  The  enzyme  from  the  hyperthermophilic  archaeon  Pyrococcus  furiosus  has  been  well 
characterized  and  shown  to  function  as  an  uptake  hydrogenase  to  provide  NADPH  for 
biosynthesis,  although  mutational  analysis  has  shown  that  it  is  not  required  for  growth  or  for 
supplying  NADPH  to  other  metabolic  processes  under  usual  laboratory  growth  conditions  [71- 
73], 

Group  3c:  Some  archaea,  primarily  methanogens,  encode  a  trimeric  methylviologen  reducing 
hydrogenase  (MvhAGD).  MvhAGD  fonns  a  complex  with  heterodisulfide  dehydrogenase 
(HdrABC)  that  in  turn  provides  the  physiological  electron  acceptor  heterodisulfide  CoM-S-S- 
CoB  [74].  With  the  discovery  of  electron  bifurcation  it  became  clear  that  the 
MvhADG/HdrABC  complex  catalyzes  the  reduction  of  heterodisulfide  and  Fd  simultaneously 
with  H2  as  reductant  at  high  specific  activity,  while  in  the  absence  of  Fd  only  low  rates  are 
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observed  [75].  The  complex  thus  appears  to  couple  the  endergonic  reduction  of  Fd  (E’  =-500 
mV)  to  the  exergonic  reduction  of  CoM-S-S-CoB  (E’o=  -140  mV)  with  H?  (E’o=  -414  mV)  as 
the  electron  donor,  providing  a  putative  mechanism  by  which  methanogens  conserve  energy  [42, 
76], 

Group  3d:  This  subgroup  consists  of  bidirectional  heteromultimeric  [NiFe]-hydrogenases 
(HoxHY)  that  are  associated  with  an  additional  NADH  oxidoreductase  (diaphorase)  module, 
which  shows  distinct  homology  to  the  NADH  input  module  of  Complex  I  [12].  The  bidirectional 
hydrogenases  are  found  in  aerobic  H2-utilizing  organisms  such  as  Ralstonia  eutropha,  in  which 
they  most  likely  catalyze  H2  oxidation  and  supply  reducing  equivalents  (NADH)  to  Complex  I 
for  energy  generation  or  provide  reductant  for  biosynthesis  [77].  These  bidirectional 
hydrogenases  are  also  found  in  many  cyanobacteria  where  they  function  to  dispose  of  excess 
electrons  derived  from  fennentation  and  photosynthesis  [65].  In  the  cyanobacterium 
Synechocystis  spp.  PCC  6803  it  was  shown  that  the  bidirectional  hydrogenase  could  also  directly 
accept  electrons  from  Fd  or  flavodoxin,  which  explains  the  production  of  H2  by  over-reduction 
of  the  Fd  pool  in  the  light  or  fermentative  metabolism  under  anoxic  dark  conditions  [78], 

Group  4 

Group  4  [NiFe]-hydrogenases  are  all  membrane-bound  and  are  mostly  Fd-dependent  enzymes. 
These  enzymes  cluster  distinctly  from  all  the  other  hydrogenases  indicating  that  they  have  a 
separate  evolutionary  history  [11].  The  members  of  this  group  fall  into  3  major  subgroups:  1)  six 
subunit  reversible  ion-translocating  enzymes  termed  Ech  (energy  converting  hydrogenase), 
which  are  found  in  various  orders  of  bacteria  and  a  few  archaea  [20],  2)  a  thirteen  (or  even  more) 
subunit  membrane  bound  and  ion-translocating  enzyme  (Mbh)  with  subunits  homologous  to 
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Na+/H+  antiporters  (Mrp)  and  these  Mrp-Mbh  enzymes  are  common  in  the  thennophilic  archaea 
[79],  and  3)  a  variant  of  the  Mrp-Mbh  (Eha /  Ehb)  present  in  hydrogeno trophic  methanogens  that 
contains  additional  subunits  (17-20  subunits)  and  uses  an  ion  gradient  to  reduce  Fd  with  H2  [80]. 
The  group  4  hydrogenases  also  include  a  number  of  distinct  H2-evolving  multienzyme 
complexes  such  as  the  formate  hydrogen  lyase  of  E.  coli  and  Thermococcus  onnurineus  that 
oxidizes  formate  and  evolves  H2  [81,  82],  and  the  CO-induced  hydrogenases  of  T.  onnurineus 
and  Carboxydothermus  hydrogenof ormans,  which  are  involved  in  generating  energy  from  the 
oxidation  of  CO  to  C02  coupled  with  the  production  of  H2  [83,  84].  The  six-subunit  energy 
converting  hydrogenase  (EchA-F)  was  originally  characterized  in  Methanosarcina  barkeri  where 
it  was  found  to  be  required  for  its  growth  on  acetate  [85].  This  hydrogenase  can  reversibly 
generate  H?  from  reduced  Fd  with  the  concomitant  generation/utilization  of  a  proton  gradient; 
homologs  of  this  enzyme  are  found  among  anaerobic  bacteria  and  few  archaea.  The  Mrp-Mbh- 
type  energy-converting  H2-producing  enzymes  were  first  described  in  the  hyperthermophilic 
archaeon  P.  furiosus  [79,  86,  87].  The  striking  difference  between  Mrp-Mbh  and  Ech  is  the 
presence  of  the  Mrp-like  H+/Na+  antiporter  module  in  the  Mrp-Mbh  complex  and  the  subsequent 
generation  of  a  sodium  gradient  rather  than  a  proton  gradient.  This  Fd-dependent  enzyme  is 
linked  to  the  glycolytic  pathway  found  in  hetero trophic  archaea  that  is  unique  in  that  Fd  is  the 
only  electron  carrier  (NADH  is  not  produced).  This  allows  for  the  efficient  production  of  4 
moles  of  H2  per  mole  of  glucose  [88]  The  energy  converting  hydrogenases  of  hydrogenotrophic 
methanogens  (Eha  and  Ehb),  are  thought  to  provide  reduced  Fd  for  biosynthesis  and  to  balance 
the  first  step  of  methanogenesis,  the  Fd-dependent  reduction  of  C02  to  fonn  fonnylmethanofuran 
[80,  89], 
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The  group  4  hydrogenases  also  contain  a  subunit  core  that  has  close  homologs  in  three 
non-hydrogenase  enzyme  complexes.  These  are  tenned  cofactor  F42o  oxidoreductase  (Fpo)  found 
in  methanogenic  archaea,  a  membrane  oxidoreductase  (Mbx)  involved  in  sulfur  reduction  by 
hyperthermophilic  archaea  such  as  P.  furiosus,  and  NADH  quinone  oxidoreductase  (Complex  I, 
Nuo)  found  in  aerobic  bacteria  and  eukaryotes  [21,  39].  In  particular,  the  close  relationship 
between  group  4  hydrogenases  and  components  of  Complex  I  of  the  ubiquitous  aerobic 
respiratory  chain  indicates  an  intricate  evolutionary  history  between  H2  and  oxygen  respiration 
[21,39]. 

Group  5 

Recently  a  group  5  [NiFe]-hydrogenase  was  proposed  based  on  genome  sequence,  phylogenetic 
reconstructions,  and  biochemical  characterization  of  an  enzyme  from  Streptomyces  spp.  that  is 
capable  of  oxidizing  very  low  (<1  ppm)  levels  of  H2  [90,  91].  This  was  corroborated  with  the 
characterization  of  several  Streptomyces  soil  isolates  [92].  These  group  5  hydrogenases  are 
similar  in  structure  to  the  group  1  enzymes  in  that  they  also  consist  of  a  small  and  large  subunit 
but  a  potential  cytochrome  containing  membrane  anchor  is  not  obvious.  Thus,  while  appearing 
to  be  biochemically  similar  to  group  1  H2-oxidizing  enzymes,  the  group  5  enzymes  are  distinct 
from  the  group  1  enzymes  and  their  [NiFe]  binding  motifs  are  different  [90]. 

Hydrogenase  Mechanism 
[NiF  e(Se)]  -hydrogenases 

Catalytic  site  structure,  coordination  sphere  and  H2  activation  mechanism 
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X-ray  structures  of  both  [NiFe]-  and  [NiFeSe]-hydrogenases  poised  in  various  active  and 
inactive  states  have  been  collected  and  these  have  provided  detailed  insights  into  cofactor 
arrangements  and  coordination  environments  [9,  93-101],  The  bimetallic  cofactor  of  [NiFe(Se)] 
hydrogenase  is  composed  of  Ni  and  Fe  atoms  bridged  by  a  pair  of  Cys  thiolates.  Two  CN-  and 
one  CO  complete  the  coordination  of  the  iron  ion  to  create  a  subsite  resembling  the  2Fe 
subcluster  of  the  [FeFe] -hydrogenase  H  cluster  (Fig.  4).  A  second  pair  of  Cys  thiolates  completes 
the  coordination  of  the  Ni  atom,  one  of  which  is  replaced  by  selenocysteine  in  [NiFe(Se)] 
hydrogenases.  Surrounding  the  Ni-Fe  cluster  are  conserved  Arg,  His  and  Glu  residues,  the 
functional  groups  of  which  create  a  network  of  exchangeable  sites  near  or  within  H-bonding 
distance  to  the  Ni-Fe  cluster  illustrated  in  Fig.  4A.  Mutagenesis  of  the  nearby  Arg,  Glu  and  His 
residues  leads  to  loss  or  attenuation  of  catalytic  activity,  consistent  with  their  proposed  function 
in  proton  transfer  (PT)  and/or  H-bonding  interactions  with  the  [NiFe]  cluster  [102,  103],  In 
addition,  electron  nuclear  double  resonance  (ENDOR)  spectroscopy  and  high-resolution 
crystallography  have  shown  that  the  Se/S  groups  that  coordinate  Ni  can  function  as  a  base  to 
accept  protons  during  PT  and  H2  activation  [104], 


[NiFe(Se)-hydrogenase  oxidation  states  under  H 2  activation 

Detailed  summaries  and  overview  of  the  various  crystallographic,  paramagnetic  (e.g., 
EPR,  ENDOR),  and  infrared  (IR)  spectroscopic  properties  of  catalytic  intermediates  of 
[NiFe(Se)]-hydrogenases  have  recently  been  extensively  reviewed  [10,  104-106],  There  are 
several  reasonably  defined  catalytic  states  observed  in  the  different  classes  of  [NiFe(Se)]- 
hydrogenases  (Table  1).  Notwithstanding  the  significant  experimental  and  theoretical  studies 
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applied  to  the  mechanism  of  [NiFe(Se)]-hydrogenases,  definitive  experimental  evidence  for  the 
site  of  H2  binding  is  lacking,  though  most  models  support  Ni  in  this  function  [89,  90].  Reaction 
intermediates  isolated  under  catalytic  conditions  have  been  modeled  into  a  mechanism  of 
activation,  but  many  of  the  details  are  unresolved  [104,  105,  107,  108]. 


Table  1.  States  identified  in  [NiFe(Se)]-hydrogenases  under  H2. 


Catalytic 

State 

Oxidation 

State 

EPR 

(g-values) 

FTIR 

(cm1) 

Ligand 

Ni 

Fe 

vCO 

vCN 

aNi-SI 

II 

II 

Silent 

1943 

2075,  2086 

h2o 

Ni-C 

III 

II 

2.21,2.15,2.01 

1961 

2074,  2085 

H" 

Ni-R 

II 

II 

Silent 

1948 

2061,2074 

H 

bNi-L 

I 

II 

2.28,2.11,2.05 

1911 

2048,  2062 

H+ 

aNi-TR  for 

[NiFeSe]  hydrogenases. 

h  Photoproduct  observed  after  illumination  at  <200K. 


In  the  resting  state  of  [NiFe(Se)]-hydrogenases,  Ni-SI,  the  Fe  center  lies  within  id- 
bonding  distance  of  the  Ni,  whereby  H2  binding  and  heterolytic  cleavage  at  Ni11  would  be 
facilitated  by  PT  to  result  in  a  p-hydride  intennediate  (Fig.  5)  [109].  Alternatively,  experimental 
and  theoretical  models  propose  initial  H2  binding  and  activation  can  occur  at  the  Fe  site,  when 
the  Ni  is  modeled  as  high  spin  Ni11  [107,  108,  110].  Cleavage  is  proposed  to  result  in  PT  to  one 
of  the  coordinating  thiolate  or  selenate  groups,  and  subsequently  to  nearby  conserved  Glu  [102]. 
The  resulting  hydride  is  bound  to  the  Ni-C  (or  Ni-R)  intermediate  and  based  on  spectroscopic 
analysis  together  with  density  functional  theory  (DFT)  calculations,  it  is  proposed  to  adopt  a 
bridging  conformation  between  the  Ni  and  Fe  atoms  [109,  ill,  112],  Much  of  the  detail  on 
reaction  intermediates  of  [NiFe]-hydrogenases  have  relied  on  studies  of  a  subset  of  the 
[NiFe(Se)]-hydrogenases.  Thus,  it  is  quite  possible  for  unique  aspects  to  emerge  from  future 
studies  of  a  more  diverse  array  of  enzymes. 
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[F  eF  e]  -hydrogenases 


Catalytic  site  structure  and  coordination  sphere 

Two  x-ray  crystal  structures  of  [FeFe]-hydrogenases  from  Clostridium  pasteurianum  (Cp\) 
and  Desu/fovibrio  desulfuricans  ( DdR )  revealed  the  active  site  H  cluster  (Fig.  4B),  a  unique 
[6Fe-6S]  organometallic  cluster  comprised  of  a  [4Fe-4S]  subcluster  and  2Fe  subcluster,  which 
are  linked  through  a  protein  cysteine  thiolate  ligand  [113,  114].  The  geometric  and  electronic 
properties  of  the  2Fe  subcluster  are  closely  connected  to  the  [4Fe-4S]  subcluster  by  strong  spin 
coupling  exchange  through  the  Cys  thiolate  ligand  between  the  two  subclusters  [115-120], 

Reduction  of  protons  to  H2  occurs  at  the  2Fe  subcluster  site,  which  includes  a  proximal  Fe 
atom  (Fep)  and  distal  Fe  atom  (Fed)  (in  relation  to  the  [4Fe-4S]  cubane),  each  bound  to  a  terminal 
CO  and  CN-  ligand,  bridging/semi-bridging  CO  ligand,  and  a  bridging  dithiolate  ligand.  A  ligand 
exchangeable  coordination  site  also  resides  at  Fca  and  is  a  potential  site  for  the  formation  of 
hydride  intermediates  [121],  The  CO  and  CN-  ligands  promote  reversible  hetero lytic  H2  cleavage 
and  stabilize  low  spin  states  of  the  two  Fe  atoms  of  the  subcluster  [122,  123].  The  CN-  ligands 
further  fine-tune  the  energy  levels  of  the  frontier  orbitals  of  the  [4Fe-4S]  and  2Fe  subcluster 
moieties,  which  helps  make  possible  fast  electron  transfer  (ET)  between  the  two  subclusters 
during  catalysis  [124].  The  functional  bridgehead  group  of  the  dithiolate  ligand  was  recently 
confirmed  as  an  amine  [125,  126],  an  assignment  previously  suggested  from  analysis  of  DdR 
[127]  and  advanced  EPR  methods  [115],  and  theoretical  models  to  function  in  PT  to  Fed  during 
H2  activation  [128]. 

In  the  enzyme,  the  H  cluster  resides  in  a  hydrophobic  pocket  where  nearby  residues 
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participate  in  H-bonding  interactions  to  the  diatomic  ligands  of  the  2Fe  subcluster  [113,  114, 
129].  The  surrounding  protein  framework  (inely-tuncs  the  H-cluster  and  is  considered  to  play  an 
important  role  in  regulating  its  catalytic  activity,  electronic  properties,  and  potential  hydride 
binding  sites  [8,  118,  130-134].  Within  the  catalytic  site  are  several  conserved,  charged  residues 
that  fonn  the  secondary  coordination  sphere,  with  exchangeable  groups  proposed  to  function  in 
the  transfer  of  protons,  water  coordination,  and  bonding  interactions  with  the  H  cluster  (Fig.  4B) 
[8,  121,  129,  133,  135], 


Table  2.  Redox  states  identified  in  [FeFe]-hydrogenases  under  H2. 


Catalytic 

State 

Oxidation  State 

EPR 

(g-values) 

FTIR  (cm'1) 

Fed 

Terminal 

Ligand 

2Fe 

[4Fe-4S] 

Fe„ 

Fed 

vCO 

vCN 

aH0X 

II 

I 

2+ 

2.10,2.04, 

1.99 

1964,  1940, 
1802 

2089,  2071 

h2o 

bH0X 

I 

II 

2+ 

2.10,2.04, 

1.99 

1964,  1940, 
1800 

2088,  2072 

h2o 

cHred 

I 

I 

2+ 

Silent 

1916,  1891, 
1792 

2038,  2034 

H+ 

_aHred 

I 

I 

2+ 

Silent 

1935, 1891, 
1793 

2083,  2070 

Open  (H) 

eHred 

I 

I 

2+ 

Silent 

1965,  1916, 
1894 

2079,  2041 

H+ 

Hsred 

I 

I 

1+ 

ND 

1955,  1932, 
1883 

NA 

NA 

gHsred 

I 

I 

1+ 

2.08,  1.94, 

1.88 

1954,  1919, 
1882 

2070,  2026 

H+ 

hHsred 

I 

I 

1+ 

Broad  2.3- 
2.07 

1933,  1883,? 

2085,  ? 

H+ 

a  Based  on  [136-14 

10]. b  Based  on  [1 15,  141,  142], c  Based  on  [139],  d  1 

Based  on  [8,  143-145]. e 

Based  on  [123,  140,  143-145]. f Based  on  [140].  g  Based  on  [143,  144],  h  Based  on  [139],  “?” 
signifies  unresolved  assignments  for  CO/CN  modes. 
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The  emerging  role  of  [4Fe-4S]  in  the  [FeFe]-hydrogenase  catalytic  cycle 


Recent  spectroscopic  investigations  of  the  minimal  algal  [FeFe]-hydrogenase  CrHydA, 
have  put  focus  on  the  [4Fe-4S]  and  its  role  as  an  ET  relay  during  catalysis  (Fig.  6,  and  Table  2). 
Initial  spectroelectrochemical  FTIR  and  EPR  investigations  on  D(FA  led  to  identification  of  a 
“super-reduced”  [140],  or  Hsred,  H  cluster  state,  which  was  later  confirmed  in  CrHydA  and 
assigned  as  a  [4Fc-4S]  '  -Fc'Fc1  [141,  146].  Further  EPR  and  FTIR  studies  of  CrHydA  under 
reducing  conditions  provided  evidence  for  multiple  H  cluster  intermediates  containing  a  reduced 
[4Fc-4S] 1 1  during  both  catalytic  H2  activation  and  proton  reduction.  This  implicated  a  role  as  an 
electron  mediator  between  2Fe  and  Fd  [147].  X-ray  spectroscopy  along  with  DFT  on  CrHydA 
found  that  for  a  complete  model  of  the  H-cluster,  the  LUMO  resides  over  the  [4Fe-4S]  subcluster 
and  is  close  in  energy  to  the  HOMO  localized  on  Fed  of  2Fe,  further  supporting  its  role  as  an 
initial  relay  into  the  2Fe  subcluster  during  H2  activation  [148]. 

These  fonns  of  H-cluster  reduced  states  with  [4Fe-4S]1+  have  led  to  revisions  [141,  147] 
of  earlier  models  for  H2  activation  by  [FeFe]-hydrogenases  [149].  Formally,  H2  activation  was 
principally  based  on  PT/ET  transitions  between  the  Hox  ([4Fe-4S]2+-FeIIFeI)  and  Hred  ([4Fe- 
4S]2+-FeIFeI)  states.  These  two  redox  states  of  the  H  cluster  were  assigned  from  early 
spectroscopic  studies  of  more  complex  [FeFe]-hydrogenases  containing  additional  FeS  cofactors, 
or  “F  clusters”  [150-153].  Thus,  it  is  possible  that  trapping  of  [4Fe-4S]*  intermediates  in  these 
enzymes  was  prevented  by  ET  between  the  H  cluster  and  F  clusters  during  H2  activation. 
Alternatively,  it  is  possible  that  some  of  these  intermediates  that  have  been  observed  in 
CrHydA  1  might  be  unique  to  this  enzyme  due  to  its  lack  of  F  clusters. 

The  role  of  Hsred  as  a  catalytic  intermediate  has  come  under  question  recently  based  on 
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protein  film  voltammetry  (PFV)  studies  of  CrHydA.  In  one  case,  CrHydA  was  observed  to 
inactivate  at  low  potentials,  leading  to  the  hypothesis  that  Hsreci  might  not  be  catalytically  relevant 
[140,  154].  This  observation  differs  from  the  high  catalytic  currents  of  enzymes  at  the  low 
reduction  potentials  (<500  mV  vs.  NHE)  [141,  155,  156],  conditions  that  enrich  for  the  Hsred  state 
in  spectroelectrochemical  FTIR  [144],  Clearly,  identification  of  a  [4Fe-4S]’  in  CrHydA  under 
different  reducing  conditions  indicates  that  oxidation  of  H2  is  a  tightly  coupled  two-electron, 
two-proton  reaction,  and  facile  intramolecular  ET  steps  to  F  clusters  of  more  complex  [FeFe] 
hydrogenases  likely  prevented  the  earlier  detection  of  a  [4Fe-4S]1+  oxidation  state.  Mutants 
where  PT  is  blocked  or  disrupted  could  be  a  useful  strategy  towards  capturing  reduced  states  and 
transient  intermediates  during  catalysis  [149]. 


H 2  binding  and  activation  at  the  H  cluster;  activation  at  FeJ 1 

The  resting  state  of  the  H  clusters  in  C/4,  DdH  and  CrHydA  are  formally  assigned  to  a 
mixed-valent  2Fe  subcluster  paired  to  a  diamagnetic  [4Fe-4S]2+  subcluster  based  on  the 
collective  EPR/FTIR  spectra  of  oxidized  states  (Hox)  of  various  enzymes  (Table  2).  Oxidation 
state  assignments  of  the  Fed  /  Fep  pair  in  the  Hox  state,  and  the  localization  of  the  unpaired  spin, 
are  currently  under  debate  [115,  120,  157]  (Fig.  6).  Nevertheless,  there  is  agreement  that  the 
initial  step  of  H2  binding  occurs  at  the  open  coordination  site  of  Fed  in  the  Fe11  oxidation  state. 
Evidence  in  support  of  this  comes  from  spectroscopic  studies  on  57Fe  labeled  DdH  poised  in  Hox 
and  CO-inhibited  states  [115].  The  effect  of  CO  on  Hox  was  used  to  interpret  the  effect  of  H2 
binding  on  the  H  cluster  electronic  state.  CO  binding  induced  stronger  couplings  between  the 
[4Fe-4S]  cubane  and  the  2Fe  subcluster,  consistent  with  a  shift  of  the  unpaired  spin  from  the  2Fe 
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subcluster  toward  [4Fe-4S]  (e.g.,  electron  exchange  from  Fed-^Fep).  More  recently  pulse  EPR 
studies  on  isotopically  labeled  H  cluster  with  13C15N  ligands  support  assignment  of  the  unpaired 
spin  in  the  Hox  state  to  Fed  and  delocalization  of  the  spin  over  both  Fe  ions  upon  CO  binding,  a 
picture  also  consistent  with  a  shift  of  unpaired  spin  toward  [4Fe-4S]  [157].  A  completely 
synthetic  H  cluster  mimic  has  also  been  shown  to  proceed  via  a  Fc'-^Fc"  transition  of  a 
diiron/dithiolate  subsite  to  accomplish  H2  binding  and  activation  [158].  In  this  example  of  bio¬ 
inspired  chemistry,  the  diferrous  state  was  induced  via  oxidation  by  a  covalently  attached 
ferrocene  (e.g.,  [4Fe-4S]2+).  The  intramolecular  ET  step  led  to  binding  and  activation  of  H2  at  the 
open  coordination  site  of  the  Fed11  site,  similar  to  the  mechanism  based  on  enzyme  studies  as 
proposed  in  Fig.  6,  Scheme  B. 


Common  functional  themes 

Catalytic  bias  and  electron-transfer  reactions 

Under  electrochemical  PFV  or  biochemical  assays  with  redox  dyes  of  varying  potentials, 
different  hydrogenases  have  been  shown  to  possess  differences  in  catalytic  bias,  or  reaction 
directionality  [10,  159,  160].  The  degree  of  reaction  bias,  or  extent  that  H2  production  or 
oxidation  is  favored,  is  intrinsic  to  each  particular  hydrogenase  [159,  161-163].  This  property  has 
implications  in  the  function  of  hydrogenase  in  a  metabolic  network,  which  is  how  the  H2 
activation  step  is  coupled  to  specific  redox  partners.  Most  or  all  members  of  each  enzyme  class 
have  been  shown  by  PFV  to  operate  near  to  the  thermodynamic  potential  of  the  H?/2H  redox 
couple.  Structural  factors  that  control  enzyme  reaction  bias,  and  to  what  degree  these  are  enzyme 
specific,  remain  ongoing  areas  of  investigation.  Several  hypotheses  have  been  proposed  for  how 
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bias  might  be  controlled  based  on  differences  in  the  enzyme  structures.  One  model  for  bias 
control  is  based  on  the  differences  in  the  potential  of  the  electrons  entering  the  catalytic  site  [162, 
164].  Thus,  the  evolution  of  enzyme  and  accessory  cluster  diversity  has  a  significant  role  in 
tuning  the  catalytic  function  of  the  particular  hydrogenase  to  couple  to  a  given  metabolic 
pathway.  Other  possible  controlling  factors  of  reaction  bias  are  proton  availability  [8],  midpoint 
potentials  of  the  le'  oxidized/reduced  forms  of  bimetallic  clusters,  role  of  secondary  sphere 
coordination  environment  [165]  and  the  accessibility  of  metal  oxidation  states  and  electronic 
structures  [160]. 


Low  spin,  low-valent  Fe(CO)(CN)  centers  and  H ?  activation 

A  functional  theme  that  has  emerged  from  hydrogenase  active  site  theoretical  and  model 
compound  studies  is  the  role  of  a  square  planar  FeI/n(CO)(CN)  center  in  enzymes  that  function  in 
H2  binding  and  activation.  The  functional  importance  of  the  Fe  site  in  [NiFe]-hydrogenases  is 
perhaps  less  obvious  than  for  the  [FeFe]-hydrogenases  where  FF  binding  and  activation  occur 
directly  at  a  Fe(CO)(CN)  subsite.  For  both  enzyme  families,  inclusion  of  this  organometallic 
functional  group  as  a  component  of  the  bimetallic  catalytic  sites  is  likely  required  to  keep  the 
respective  metal  centers  in  low-spin,  low  valence  states,  where  the  reaction  coordinate  for  FF 
activation  is  thermodynamically  favored  [166-168], 


Maturation 

[NiFe] -Hydrogenase  Active  Site  Assembly 
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Six  hyp  genes  denoted  hypA  -  hypF  (and  their  associated  homologs)  are  required  for 
biosynthesis  and  insertion  of  the  NiFe(CN)2CO  catalytic  cluster  into  the  large  subunit  of  [NiFe]- 
hydrogenases. 

The  Synthesis  of  Thiocyanate:  HypE  and  HypF 

HypC,  D,  E,  and  F  proteins  are  involved  in  Fe(CN)2CO  biosynthesis  and  are  absolutely 
required  for  maturation  of  [NiFe]-hydrogenases  [169-171].  The  cyanide  ligands  of  the  active  site 
cluster  are  derived  from  carbamoylphosphate  in  a  series  of  reactions  involving  HypE  and  HypF 
(Figure  7)  [172].  HypF  utilizes  carbamoylphosphate  as  substrate  and  first  converts  it  to 
carbamate  and  then  to  carbamoyladenylate  (in  an  ATP-dependent  step)  before  finally 
transferring  the  carbamoyl  functional  group  to  the  C-terminal  cysteine  residue  of  HypE  (Figure 
7, 1  -  V)  [172,  173].  The  intennediates  that  are  produced  by  HypF  are  quite  unstable,  however,  a 
mechanism  for  their  fonnation  is  suggested  by  the  HypE-HypF  complex,  a  heterotetrameric 
structure  comprised  of  a  HypE  dimer  flanked  by  two  HypF  molecules  [174].  The  structure 
supports  a  mechanism  in  which  the  N-terminal,  HypF  acylphosphatase  domain  hydrolyzes 
carbamoylphosphate  to  carbamate  and  inorganic  phosphate;  carbamate  is  transferred  through  an 
internal  channel  to  the  YrdC  domain  where  ATP  is  bound  and  carbamoyladenylate  formation 
occurs  with  release  of  pyrophosphate  [174,  175],  Carbamoyl  group  transfer  to  the  C-terminal 
cysteine  of  HypE,  with  the  liberation  of  AMP,  occurs  within  the  C-terminal  Kael-like  domain 
[174].  The  Kael-like  domain  of  HypF  binds  a  mononuclear  Fe  ion  via  two  His  and  two  Asp,  in 
similar  fashion  to  other  ASKHA  superfamily  members  like  Kael  and  TobZ  [174,  176-178].  In 
HypF,  this  Fe  ion  coordinates  the  carbamoyladenylate  moiety,  promoting  the  transferase  reaction 
[174]. 
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The  thiocarboxamide  group  bound  to  HypE’s  tenninal  Cys  is  converted  to  thiocyanate  in 
a  manner  reminiscent  of  reactions  catalyzed  by  fonnylglycinamide  ribonucleotide 
amidotransferase  (PurL)  and  amino  imidazole  ribonucleotide  synthetase  (PurM);  HypE,  PurL, 
and  PurM  all  utilize  ATP  to  form  phosphoryl  anhydride  intermediates  [179-181].  In  the  case  of 
HypE,  the  thiocarboxamide  oxygen  group  is  within  van  der  Waals  distance  of  the  y  -  phosphate 
group  of  ATP  and  is  optimally  positioned  for  in-line  attack  [182].  The  thiocarboxamide  group  is 
proposed  to  become  activated  via  deprotonation  by  an  active  site  H2O  molecule.  This  H2O 
hydrogen  bonds  to  Lysl34  which  in  turn  is  in  close  proximity  to  an  Arg  residue  that  decreases  its 
effective  pKa  to  approximately  5.1,  allowing  Lysl34  to  act  as  a  general  base  [182],  Following 
the  in-line  attack  of  ATP,  ADP  is  released  and  a  thiocarbamic  phosphoryl  anhydride  species  is 
formed  [181,  182].  This  iminophosphate  intennediate  is  then  activated  for  dephosphorylation 
via  deprotonation  of  the  imino  nitrogen  by  a  conserved  Glu272  residue  [182].  The  resulting 
thiocyanate  bound  to  Cys  at  the  C-terminus  of  HypE  is  on  a  flexible  loop  that  not  only  permits 
the  insertion  of  the  critical  cysteine  residue  near  the  carbamoylation  active  site  in  the  Kael 
domain  of  HypF,  but  then  also  facilitates  delivery  of  the  thiocyanate  to  the  HypC-HypD  complex 
where  delivery  of  CN  to  iron  occurs  (Figure  8)  [172,  181,  183]. 

Fe(CN)2CO  Cofactor  Biosynthesis:  HypC  and  HypD 

Proteins  in  the  HypC  family  of  proteins  are  approximately  10  kDa  in  size  and  associate 
with  HypD  and  a  precursor  form  of  the  hydrogenase  large  subunit,  and  also  form  a  ternary 
complex  with  HypD  and  HypE  [22,  184-189].  HypC  is  structurally  simple,  being  comprised  of  a 
C-tenninal  a-helix  joined  to  a  (3-barrel  in  what  is  known  as  an  OB-fold  [188,  190,  191].  HypD 
proteins  are  approximately  40  kDa  in  size  and  contain  a  unique  C-tenninal  CX14CX6CX16C 
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motif  in  domain  III  that  coordinates  a  [4Fe-4S]  cluster,  whereas  domains  I  and  II  are  both 
characterized  as  Rossmann  folds  [186,  191,  192],  The  cluster  environment  in  domain  III  shares 
similarities  with  the  ferredoximthioredoxin  reductase  system  wherein  a  redox  cascade  is  created 
that  involves  the  [4Fe-4S]  cluster  and  a  pair  of  cysteine  residues  that  are  in  close  proximity  to  it, 
although  these  Cys  residues  are  not  absolutely  conserved  in  all  HypD  isofonns  [189,  191,  193]. 
The  redox  cascade  is  extended  to  four  other  conserved  motifs  in  HypD  (CGXHXH, 
GPGCPVCX2P,  GFETT,  and  PXHVSX3G)  and  this  conduit  has  been  proposed  to  play  a  role  in 
the  mechanism  of  iron  cyanation  [187,  188,  191], 

Co-expression  studies  revealed  that  substoichiometric  amounts  of  HypE  were  associated 
with  a  HypC-HypD  complex  and  that  this  complex  accepted  CN  from  HypE;  CN  was  not 
transferred  to  either  HypC  or  HypD  when  they  were  expressed  singly,  suggesting  that  the 
HypC-HypD  complex  shared  coordination  of  the  CN  group  (Figure  8)  [186].  The  exact  details 
of  diatomic  ligand  coordination  to  Fe  at  this  stage  of  maturation  are  not  fully  resolved  and  a 
consensus  model  has  not  yet  been  attained.  Structural  characterization  of  the  HypC-D-E  ternary 
complex  provides  a  picture  for  how  the  conserved  motifs  of  HypD  create  the  scaffold  for 
Fe-cyanation;  a  model  is  proposed  wherein  Cys38  from  HypD  and  Cys2  of  HypC 
( Thermococcus  kodakarensis  numbering)  bind  an  Fe  ion  along  with  two  CN  ligands  and  two 
unresolved  moieties  [188].  Spectroscopic  characterization  of  anaerobic  preparations  of 
HypC-HypD  complexes  containing  substoichiometric  amounts  of  HypE  from  E.  coli  [187]  and 
HypC-HypD  complexes  from  both  Ralstonia  eutropha  and  E.  coli  [189]  show  FTIR  bands  that 
correspond  to  an  Fe(CN)2CO  species  similar  to  that  observed  in  [NiFe]-hydrogenase. 
Importantly,  the  characterization  of  the  HypC-HypD  complex  reported  by  Soboh  et  al.  revealed 
the  existence  of  two  labile  Fe  ions  in  addition  to  the  [4Fe-4S]  cluster,  and  also  showed  an  IR 
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feature  at  2337  cm'1  assigned  to  vC02;  Cys41  in  HypD  was  shown  to  be  required  for  the 
coordination  of  CN  ,  CO,  and  CO2,  presumably  to  one  of  the  labile  Fe  ions  [187].  Analysis  of 
anaerobically  purified  HypD  showed  that  it  contained  FTIR  bands  associated  with  a  CO  and  two 
CN  groups,  consistent  with  the  presence  of  an  Fe(CN)2CO  moiety;  no  2337  cm'1  C02  feature 
was  present  (Figure  8)  [194]. 

Insight  into  the  significance  of  C02  was  provided  in  a  subsequent  report  that  examined 
individually  purified  HypC  along  with  its  E.  coli  homolog  HybG  [195].  Characterization 
revealed  that  under  anaerobic  conditions  the  proteins  contained  ~0.3  mol  Fe  per  mol  protein  and 
exhibited  single  FTIR  bands  arising  from  C02.  Evaluation  of  variant  HypC  proteins  conclusively 
demonstrated  that  both  Cys2  and  His51  were  absolutely  required  for  Fe  and  C02  coordination 
[195],  This  observation  coupled  with  the  presence  of  the  2337  cm"1  feature  in  the  HypC-HypD 
complex  suggests  that  HypC  delivers  Fe-C02  to  HypD  where  the  C02  is  reduced  to  CO  (Figure 
8)  [187,  195].  Importantly,  these  results  and  others  suggest  that  metabolic  C02  is  the  source  of 
the  CO  ligand,  although  this  has  yet  to  be  experimentally  demonstrated  [195,  196], 

In  the  case  of  HypD,  Cys41,  Cys69,  and  Cys72  are  all  critical  for  synthesis  of  the 
intermediate  Fe(CN)2CO  cluster  [194].  Collectively,  these  data  suggest  a  model  wherein  Cys2 
and  His51  of  HypC  and  Cys41  and  His44  of  HypD  (E.  coli  numbering)  come  together  to 
coordinate  a  precursor  Fe  ion  in  a  tetrahedral  environment;  addition  of  the  CO  and  the  CN 
ligands  results  in  HypC  dissociation  and  addition  of  His201  to  accomplish  formation  of  the 
octahedrally  coordinated  Fe(CN)2CO  unit  on  HypD  (Figure  8)  [194,  195].  The  assignment  of 
His44  and  His201  as  the  fourth  and  fifth  ligands  is  not  established  and  it  is  also  possible  that 
Cys69  and  Cys72  perform  this  function  [194]. 
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Insertion  ofNi2+:  Hyp  A  and  HypB 

Two  of  the  hyp  gene  products,  HypA  and  HypB,  are  involved  in  acquiring  and  inserting 
Ni2  and  studies  have  shown  that  these  proteins  are  not  absolutely  required  because  addition  of 
Ni2'  during  anaerobic  growth  restores  hydrogenase  activity  in  mutant  cell  lines  [170,  171,  197, 
198].  The  insertion  of  Ni2H  into  the  large  subunit  occurs  only  after  the  Fe(CN)2CO  moiety  is 
present  (Figure  8)  [22,  199],  HypA  is  a  140  amino  acid  protein  that  exists  in  both  monomeric 
and  dimeric  states  [200].  HypA  coordinates  a  single  Ni21  ion  with  micromolar  affinity  via  an  N- 
tenninal  MHE  motif  [200-202],  and  also  binds  Zn21  through  the  cysteine  thiolates  of  a  zinc 
finger  motif  [200],  While  the  Ni2 1  and  Zn2 1  binding  domains  are  independent  of  one  another,  the 
presence  of  Ni21  appears  to  help  dictate  the  orientation  of  these  two  domains,  likely  as  a 
mechanism  to  mediate  protein-protein  interactions  [193,  200,  202].  Moreover,  HypA  exhibits 
low  sequence  conservation  outside  of  the  Ni21  and  Zn21  motifs;  this  mirrors  the  sequence 
diversity  in  both  HypB  and  [NiFe]-hydrogenase,  all  three  of  which  putatively  interact  during 
Ni2  delivery  [193]. 

HypB  is  a  Ni2 1  -metallochaperone  that  has  GTPase  activity,  which  is  essential  to  achieve 
complete  hydrogenase  maturation  [22,  203,  204].  Several  HypB  homologs  exist  with  varying 
sequence-based  metal  binding  properties.  For  example,  E.  coli  HypB  contains  a  high  affinity, 
Ni2  binding  N-tenninal  CXXCGC  motif  (which  is  not  absolutely  conserved)  and  a  C-tenninal 
GTPase  domain  that  can  bind  either  Ni2  or  Zn21  [205-207].  HypB  dimerization  is 
Ni2+-dependent  and  its  GTPase  activity  is  modulated  by  metal  binding  [208],  The  structure  of 
the  nucleotide  bound  Methanocaldococcus  jannaschii  enzyme  shows  two  Zn2 1  ions  bound  at  the 
dimer  interface,  utilizing  a  combination  of  cysteine  thiolate,  histidine  imidazole,  and  H2O  as 
ligands  [206],  Structural  analysis  of  the  nucleotide  bound  form  of  Helicobacter  pylori  HypB,  on 
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the  other  hand,  reveals  a  single  Ni21  ion  bound  at  the  dimer  interface  via  the  tetrathiolate 
coordination  of  Cysl06  and  Cysl42  from  each  monomer,  whereas  in  the  absence  of  nucleotide 
His  107  becomes  a  metal  ligand  [209].  Cysl42  is  part  of  the  GTPase  Switch  II  motif  and  is  likely 
a  ligand  to  both  Ni21  and  Zn2  bound  fonns  of  HypB  regardless  of  the  nucleotide  bound  state 
[209].  This  enables  Cysl42  to  couple  GTP  hydrolysis  with  metal  binding  and  delivery, 
potentially  providing  a  mechanism  wherein  HypB  distinguishes  between  Ni2  -  and  Zn2 1  -loaded 
forms,  precluding  delivery  of  zinc  to  the  large  subunit  during  maturation  [206,  209,  210]. 

While  the  molecular  mechanism  for  Ni2  transfer  to  the  large  catalytic  subunit  is  not  yet 
fully  resolved,  delivery  of  nickel  in  vivo  is  likely  modulated  by  protein-protein  interactions  and 
evidence  for  HypA,  HypB,  SlyD  (a  Ni21  metallochaperone  in  E.  coli),  and  HycE  (the  large 
subunit  of  hydrogenase  3  in  E.  coli )  complexes  have  been  reported  (Figure  8)  [211-216]. 
HypB’s  interaction  with  the  large  subunit  requires  the  presence  of  HypA,  providing  support  for 
HypA  being  the  docking  protein  between  HypB  and  HycE  in  the  nickel  delivery  step  [202,  211, 
212].  Moreover,  HypA  selectively  removes  Ni2+  from  the  GTPase  domain  of  HypB  and  this 
metal  release  appears  to  be  stimulated  by  GTP  hydrolysis  [217].  Despite  the  observation  that 
Ni2  binding  to  HypB  partially  inhibits  the  GTPase  activity  of  HypB  [209],  it  is  presumed  that 
complex  formation  in  vivo  alleviates  this  retardation  and  promotes  nickel  mobilization  [217]. 
Along  these  lines,  it  has  been  demonstrated  that  in  E.  coli,  SlyD  delivers  Ni2+  to  HypB  and 
heterodimer  fonnation  between  these  proteins  promotes  GTP  hydrolysis  by  HypB  [218].  This 
provides  a  mechanism  whereby  SlyD  acts  to  mediate  the  delivery  of  Ni2  "  to  the  large  subunit  via 
the  GTPase  activity  of  HypB;  the  favorable  interaction  between  SlyD  and  the  GDP-bound  form 
of  HypB  may  help  drive  the  Ni21  insertion  process  and  overcome  any  thermodynamic  barriers 
associated  with  metal  ion  delivery  [207,  213,  218], 
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Ni2 '  Dependent  Proteolysis  and  Active  Site  Closure 

In  a  final  step  to  accomplish  maturation  of  the  large  catalytic  subunit,  a  peptide  on  the  C- 
tenninus  is  processed  (Figure  8).  Peptide  length  varies  somewhat  among  different  hydrogenases; 
a  peptide  of  approximately  15  residues  is  most  commonly  cleaved  although  maturation  of  E.  coli 
hydrogenase  3  results  in  the  removal  of  32  residues  [183].  The  peptide  extension  of  the  large 
subunit  is  necessary  for  interaction  with  HypC,  and  its  presence  helps  keep  the  large  subunit  in 
an  open  conformation  for  both  Ni21  and  Fe(CN)2CO  insertion  [184,  185,  219].  The 
endopeptidases  in  E.  coli  that  mature  the  three  cognate  [NiFe] -hydrogenases  are  HyaD 
(hydrogenase  1),  HybD  (hydrogenase  2),  and  Hycl  (hydrogenase  3)  [220],  These  enzymes 
recognize  the  DPCXXCXXH/R  consensus  motif  that  helps  coordinate  the  active  site  metal  center 
in  [NiFe] -hydrogenase;  the  proteases  cleave  between  the  basic  His/Arg  and  the  nonpolar 
Met/Ile/Val/Ala  residues  [221-224].  Nickel  promotes  the  recognition  of  the  binding  motif  by  the 
proteases  and  explains  why  cleavage  only  occurs  after  Ni21  has  been  inserted  [225].  Structural 
characterization  of  Hycl  suggests  that  two  conserved  Asp  residues  may  be  critical  for  Ni21 
coordination,  implying  that  Hycl  functions  via  metal-based  activation  wherein  the  Ni2  ion 
polarizes  the  carbonyl  oxygen  of  the  peptide  bond,  which  in  turn  fosters  hydrolysis  by  H20  [183, 
226].  Upon  cleavage  of  the  C-terminal  peptide,  a  conformational  change  in  the  active  site 
environment  is  induced  which  effectively  internalizes  the  NiFe(CN)2CO  moiety  and  affords  the 
active  enzyme  [22], 
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[FeFe]-Hydrogenase  Active  Site  Assembly 


Identification  of  the  Maturation  Machinery 

Three  genes  denoted  hydE,  hydF,  and  hydG  are  found  in  all  organisms  expressing  [FeFe]- 
hydrogenase  (HydA)  and  the  expression  of  active  HydA  in  E.  coli  requires  coexpression  with 
HydE,  HydF,  and  HydG  [26,  227].  Activation  of  HydA  expressed  singly  (HydAAEFG)  is 
accomplished  via  addition  of  E.  coli  lysate  mixtures  containing  HydE,  HydF,  and  HydG 
together;  H2(g)  generation  under  these  experimental  conditions  does  not  require  exogenous  small 
molecules,  leading  to  the  conclusion  that  the  maturase  enzymes  utilize  ubiquitous  small 
molecules  present  in  the  cellular  matrix  to  assemble  the  H-cluster  [24,  227].  Sequence 
annotation  coupled  with  preliminary  biochemical  characterization  demonstrated  that  HydE  and 
HydG  both  belong  to  the  radical  S-adenosyl-L -methionine  (SAM)  enzyme  superfamily,  while 
HydF  is  an  FeS  cluster  binding  GTPase  [26,  227-229].  Point  mutations  in  the  radical  SAM 
motifs  of  HydE  and  HydG,  as  well  as  the  FeS  cluster  and  GTPase  regions  of  HydF,  all  proved  to 
be  deleterious  to  achieving  HydA110  activation,  meaning  that  HydE,  HydF,  and  HydG 
functionalities  are  indispensable  for  proper  H-cluster  synthesis  [227]. 

Activation  of  HydAAEFG  Requires  a  Preformed  [4Fe-4S]  Cluster 

Spectroscopic  and  structural  characterization  of  the  monomeric  [FeFe]-hydrogenase  of 
the  green  algae  Chlamydomonas  reinhardtii  (Cr)  has  provided  important  insights  into  the 
maturation  process.  The  enzyme  HydAAEFG,  obtained  from  a  strain  that  lacks  the  genes  encoding 
the  three  maturation  proteins  HydEFG,  contains  the  [4Fe-4S]  cubane  of  the  H-cluster  [230,  231]. 
Importantly,  Cr  HydAAEFG  is  readily  activated  by  E.  coli  lysate  containing  HydE,  HydF,  and 


34 


HydG.  Moreover,  the  metal-free  form  of  the  protein  must  first  be  chemically  reconstituted  with 
iron  and  sulfide  prior  to  successful  activation  by  lysate  [230].  These  results  show  that  activation 
by  the  maturase  enzymes  requires  a  preformed  [4Fe-4S]  cluster  that  is  presumably  synthesized 
by  the  endogenous  iron  sulfur  cluster  assembly  machinery  of  the  cell,  and  also  suggest  that 
HydE,  HydF,  and  HydG  come  together  to  synthesize  the  2Fe  subcluster  component  of  the  H- 
cluster  [230,  231].  In  vitro  labeling  studies  provide  additional  support  for  this  model,  as 
activation  of  HydAAEFG  by  57Fe-labeled  HydE,  HydF,  and  HydG  lysates  demonstrate  the 
incorporation  of  57Fe  into  only  the  2Fe  subcluster  of  the  H-cluster  [232]. 

Comparison  of  the  Cr  HydA  I  I  (l  structure  to  that  of  C'p\  HydA  provides  insight  into  2Fe 
subcluster  insertion  [231],  An  electropositive  channel  filled  with  H2O  molecules  runs  from  the 
protein  surface  to  the  active  site  cavity  of  Cr  HydAAEFG;  this  channel  is  absent  in  the  mature  Cp  I 
HydA  structure,  which  contains  two  ordered  loop  regions  that  effectively  shield  the  H-cluster 
from  solvent.  The  loop  regions  in  Cr  HydAAEFG  are  splayed  open,  providing  clear  access  to  the 
active  site  cavity  for  2Fe  subcluster  insertion,  which  may  either  be  electrostatically-  or 
entropically-driven  [231,  233]. 

HydF  is  an  Iron  Sulfur  Cluster  Binding  GTPase 

HydF  is  a  47  kDa  monomeric  protein  that  binds  FeS  clusters  and  hydrolyzes  GTP  to  GDP 
[229,  234].  The  purified  state  of  this  protein  is  a  composite  mixture  of  dimers  and  tetramers  and 
structural  characterization  of  the  metal-free  enzyme  has  suggested  that  FeS  cluster(s)  bind  at 
subunit  interfaces  [235],  Both  FeS-  and  GTP-binding  motifs  are  essential  for  maturation  in  the 
Ca  system  [227].  Moreover,  among  the  three  maturase  enzymes,  only  purified  HydF  when 
expressed  in  a  genetic  background  of  HydE  and  HydG  (HydFEG)  could  activate  HydAAEFG.  This 
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activation  was  achieved  in  the  absence  of  exogenously  added  proteins  or  small  molecules, 
suggesting  that  HydF  bound  a  precursor  form  of  the  H-cluster  needed  to  achieve  maturation 
[236], 

Spectroscopic  comparison  of  HydFEG  to  HydFAEG  revealed  that  both  proteins  bound  [4Fe- 
4S]+  clusters,  but  HydFiEG  samples  contained  additional  LMCT  features  in  UV-Vis  spectra  and 
an  overlapping  axial  component  in  EPR  spectra  that  are  thought  to  arise  from  a  [2Fe-2S]+  cluster 
[234,  236].  Preparations  of  Ca  HydFEG  revealed  the  existence  of  Fe-CN,  Fe-CO,  and 
Fe-CO-Fe  species  via  FTIR  spectroscopy  [234,  237].  These  moieties  were  absent  in  HydFAEG, 
suggesting  that  HydE  and  HydG  modified  a  [2Fe-2S]  cluster  precursor  bound  to  HydFAEG  to 
produce  the  2Fe  subcluster  of  the  H-cluster  [27,  234], 

The  FeS  cluster  states  of  both  wild  type  and  variant  forms  of  HydF  have  been  examined 
from  various  species  and  all  of  the  results  provide  evidence  for  [4Fe-4S]  cluster  binding  to  HydF 
[229,  234,  237-242],  The  coordination  environment  for  the  [4Fe-4S]  cluster  utilizes  three 
conserved  cysteine  residues;  the  fourth  coordination  site  appears  to  be  variable,  either  being  a 
histidine  in  the  case  of  Ca  HydF  or  a  labile/exchangeable  ligand  in  Thermotoga  maritima  ( Tm ) 
and  Thermotoga  neopolitana  (Tn)  HydF  [235,  239,  241-243].  The  observation  and  significance 
of  [2Fe-2S]+  cluster  binding  by  HydF  is  still  an  open  issue.  While  similar  overlapping  axial 
signals  to  those  initially  reported  [234]  have  now  been  observed  in  reduced  preparations  of  both 
Ca  and  Tn  samples,  it  has  been  suggested  this  signal  arises  from  a  radical  in  close  proximity  to 
the  [4Fe-4S]+  cluster  [241,  242,  244].  Regardless,  multiple  lines  of  evidence  support  the  notion 
that  the  2Fe  subcluster  bound  by  HydF  resembles  the  H-cluster,  and  may  potentially  even  be 
directly  linked  to  the  [4Fe-4S]  cubane  present  on  HydF  [238,  244]. 
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While  the  exact  role  of  GTP  hydrolysis  in  [FeFe]-hydrogenase  maturation  remains 
unresolved,  it  is  clear  that  the  GTPase  activity  of  HydF  is  not  associated  with  the  activation  step 
of  HydAAEFG  by  HydFEG  [234].  In  a  manner  reminiscent  of  HypB  [209],  the  GTPase  activity  of 
HydF  is  gated  by  different  monovalent  cations  and  these  regulate  GTP  hydrolysis  by  ~  40-fold. 
HydF  therefore  belongs  to  a  subclass  of  GTPase  enzymes  wherein  an  alkali  metal  substitutes  for 
the  “arginine  finger”  of  the  partner  GTPase  activating  protein  [234].  The  ability  of  HydFEG  to 
mature  HydAAEFG  is  unaltered  by  HydF  being  in  either  a  “GTPase-on”  or  a  “GTPase-off  ’  state, 
as  established  by  accomplishing  the  activation  in  various  alkali  containing  buffers  [234]. 
However,  it  was  observed  that  the  rate  of  GTP  hydrolysis  was  stimulated  when  HydF  was 
assayed  in  the  presence  of  either  HydE  or  HydG,  leading  to  the  hypothesis  that  the  production  of 
GDP  could  be  coupled  to  gating  the  protein-protein  interactions  associated  with  2Fe  subcluster 
assembly  [234],  Experiments  designed  to  probe  protein-protein  interactions  between  the 
maturases  revealed  that  HydE  and  HydG  do  not  bind  to  HydF  concurrently,  suggesting  that  the 
radical  SAM  enzymes  associate  with  the  same  binding  site  on  HydF  [245].  Moreover,  HydE  and 
HydF  bind  to  one  another  with  an  order  of  magnitude  higher  affinity  than  HydG  binds  to  HydF 
and  addition  of  GTP  during  dissociative  phases  causes  enhanced  detachment  rates  for  both 
HydE-HydF  and  HydG-HydF  complexes.  [245].  Collectively,  these  results  implicate  GTP 
binding  and  hydrolysis  as  providing  a  mechanism  for  gating  the  interactions  between  HydE  and 
HydG  with  the  scaffold/carrier  HydF,  and  this  may  be  linked  to  nucleotide  dependent  structural 
changes  in  HydF. 

While  the  majority  of  reports  provide  support  for  the  indispensable  nature  of  HydF 
during  maturation  [24,  26,  227,  234,  236,  237],  results  with  Shewanella  oneidensis  maturases 
suggest  that  HydF  is  nonessential.  Specifically,  the  in  vitro  HydAAEFG  activation  studies  with  this 
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system  concluded  that  HydG  is  the  only  maturase  enzyme  absolutely  required  [240]. 
Interestingly,  HydAAEFG  activation  experiments  using  compounds  designed  to  mimic  the  2Fe 
subcluster  show  that  these  analogs  can  not  only  be  loaded  into  a  [4Fe-4S]  cluster  containing  form 
of  Tm  HydF,  but  that  this  charged  HydF  can  then  fully  mature  HydAAEFG  [246].  A  subsequent 
study  demonstrated  that  the  2Fe  biomimetic  analogs  could  load  into  HydAAEFG  in  the  absence  of 
HydF,  showing  that  HydF  is  not  required  for  in  vitro  activation  [247]. 

HydG  and  Diatomic  Ligand  Biosyn  thesis  from  Tyrosine 

HydG  contains  a  CX3CX2C  N-tenninal  motif  identifying  it  as  a  member  of  the  radical 
SAM  superfamily  of  enzymes  [26,  248],  This  55  kDa  monomeric  protein  exhibits  high  similarity 
to  ThiH,  an  enzyme  involved  in  thiamine  pyrophosphate  synthesis  that  cleaves  L-tyrosine  into  p- 
cresol  and  dehydroglycine  (DHG)  [248,  249].  Like  ThiH,  HydG  also  utilizes  L-tyrosine  as 
substrate  and  forms  />crcsol  (Figure  9)  [250].  HydG  is  differentiated  from  ThiH,  however,  in  the 
existence  of  a  90  amino  acid  C-tenninal  extension  containing  a  CX2CX22C  motif  that  harbors  an 
accessory  [4Fe-4S]  cluster.  The  presence  of  the  C-tenninal  [4Fe-4S]  cluster  is  essential  for 
[FeFe]-hydrogenase  maturation,  and  the  fonnation  of  the  diatomic  products  CO  and  CN  [23, 
227,  251,  252].  Diatomic  ligand  production  during  HydG  catalysis  was  demonstrated  both  by 
derivatizing  CN  into  a  l-cyanobenz[f]isoindole  adduct  [23]  and  by  trapping  CO  via  binding  to 
deoxyhemoglobin  [251].  Both  of  these  diatomic  species  formed  on  the  same  time  scale  and  it 
was  hypothesized  that  they  could  be  formed  in  a  single  step  via  a  decarbonylation  reaction 
involving  DHG  [23]. 

Analysis  of  HydG  variant  proteins  demonstrated  the  absolute  requirement  of  the  C- 
tenninal  [4Fe-4S]  cluster  for  formation  of  CO,  although  CN  was  still  detected  in  variants 
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containing  either  single  or  double  point  mutations  to  residues  in  the  CX2CX22C  motif  [253,  254], 
Assessment  of  a  variant  in  which  the  three  cysteines  of  the  CX3CX2C  N-tenninal  motif  were 
altered  to  alanine  revealed  that  SAM  coordination  to  the  enzyme  occurs  exclusively  at  the  N- 
tenninal  cluster,  despite  the  apparent  site-differentiated  nature  of  the  C-terminal  [4Fe-4S]  cluster 
[254].  Evaluation  of  ACTD  HydG,  a  variant  that  is  missing  the  C-tenninal  domain,  shows  that 
the  enzyme  still  acts  as  a  tyrosine  lyase  [253-255].  Collectively,  the  results  show  that  SAM 
binding  and  cleavage,  with  subsequent  H-atom  abstraction  from  tyrosine,  and  the  generation  of 
/?-cresol  and  DHG,  all  occur  within  the  core  TIM  barrel  fold  [244,  254]. 

Spectroscopic  characterization  of  intermediate  species  formed  during  catalysis  provided 
direct  evidence  for  heterolytic  Ca-C|3  tyrosine  bond  cleavage,  with  generation  of  a  /;-crcsolate 
radical  and  DHG  (Figure  9)  [256].  Stopped-flow  FTIR  studies  were  used  to  monitor  the  fate  of 
DHG  to  formation  of  either  Fe-CO-CN  or  Fe-(CO)2CN  moieties.  It  was  assumed  that  these 
species  fonned  at  the  site  differentiated  Fe  of  the  accessory  [4Fe-4S]  cluster  although  there  is  no 
direct  experimental  evidence  for  this  [252],  In  separate  experiments,  57Fe  was  used  to  track  iron 
transfer  from  HydG  to  HydA,  suggesting  that  the  C-tenninal  [4Fe-4S]  cluster  in  HydG  becomes 
cannibalized  to  form  the  Fe-CO-CN  building  blocks  of  the  2Fe  subcluster  [252].  Furthermore, 
it  is  known  that  all  five  diatomic  ligands  of  the  H-cluster  are  derived  from  tyrosine  [257], 
suggesting  that  multiple  rounds  of  HydG  catalysis  are  necessary  for  2Fe  subcluster  formation. 
The  mechanistic  details  of  this  process  are  unknown  but  must  be  resolved  to  delineate  the  role  of 
HydF  as  a  scaffold/carrier  protein  during  biosynthesis  [244]. 

An  Undefined  Role  in  H-cluster  Biosynthesis:  HydE 
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In  contrast  to  the  extensive  experimental  data  supporting  the  proposed  roles  of  HydF  and 
HydG  in  H-cluster  biosynthesis,  the  role  of  HydE,  a  member  of  the  radical  SAM  superfamily  of 
enzymes,  is  still  unknown.  It  has  been  suggested  that  HydE  functions  in  an  observatory  role 
during  maturation,  possibly  acting  as  a  chaperone  and  assisting  HydF  during  translocation  of  the 
2Fe  subcluster  species  [240].  It  seems  much  more  likely,  however,  that  HydE  utilizes  a  common 
metabolite  and  synthesizes  the  bridging  dithiomethylamine  ligand  (Figure  9).  Evidence  in 
support  of  this  comes  from  the  absolute  requirement  of  HydE  to  achieve  [FeFe]-hydrogenase 
activity  in  E.  coli  lysate  experiments  [24,  26,  227,  230]  and  the  difficult  chemistries  often 
associated  with  radical  SAM  enzymes  [248].  HydE  is  a  42  kDa  monomer  and  contains  a 
CX3CX2C  N-terminal  motif,  like  other  radical  SAM  enzymes  [26,  248],  However,  it  shows  high 
sequence  similarity  to  the  methylornithine  synthase  PylB  [228,  244,  258],  rather  than  to  enzymes 
like  BioB  that  catalyze  a  sulfur  insertion  reaction  [259].  While  the  significance  of  HydE’s 
relationship  to  PylB  is  difficult  to  currently  gauge,  it  is  possible  that  it  is  an  indication  that  these 
enzymes  exhibit  mechanistic  parallels. 

Multiple  X-ray  crystal  structures  of  HydE  exist  and  these  include  SAM  and  5’- 
deoxyadenosine/methionine  bound  states  [259-261],  The  global  architecture  of  HydE  shows  the 
existence  of  an  internal  electropositive  cavity  spanning  the  breadth  of  the  (|3a)s  TIM  barrel  fold. 
Three  anion  binding  sites  were  observed  within  the  internal  cavity  and  SCN  also  binds  here, 
potentially  representing  a  pathway  whereby  a  small  molecule  product(s)  traverses  from  the  top  to 
the  bottom  of  the  barrel  where  delivery  to  a  maturation  partner  protein  occurs  [259].  Surface 
plasmon  resonance  experiments  indicate  that  HydE  and  HydF  bind  to  one  another  with  high 
affinity  and  even  exist  as  a  fused  gene  product  in  some  organisms  [26,  245].  Regardless  of  these 
experimental  findings,  the  substrate  and  mechanism  of  HydE  in  H-cluster  biosynthesis  remain 
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unresolved.  Although  it  is  somewhat  tenuous  to  assign  it  a  role  in  dithiomethylamine  synthesis, 
this  seems  the  most  likely  given  the  precedence  in  the  literature  for  the  chemistries  associated 
with  HydF  and  HydG  [244].  An  intriguing  aspect  of  HydE’s  presumed  chemistry  relates  to  the 
source  of  the  sulfurs  in  the  2Fe  subcluster.  Future  work  will  hopefully  resolve  if  the  sulfurs  are 
derived  from  HydE’s  substrate  or  if  they  are  cannibalized  from  HydG’s  C-terminal  cluster. 

Mechanistic  Parallels  and  Differences  Between  [NiFe]  -  and  [FeFe]  -Hydrogenase  Biosynthesis 
[NiFe]-  and  [FeFe]-hydrogenases  are  a  superb  example  of  convergent  evolution  and  it 
follows  that  their  biosynthetic  pathways  exhibit  both  commonalities  and  differences.  Generally, 
it  is  straightforward  to  see  that  both  active  site  assembly  pathways  require  multiple  maturation 
proteins  involved  in  intricate  chemical  transfonnations.  Moreover,  each  system  utilizes  a  key 
scaffold  protein  that  belongs  to  the  same  subclass  of  GTPase  enzymes,  which  are  activated  by 
alkali  metals.  The  fonnation  and  transfer  of  Fe-diatomic  species  among  proteins  is  also  an 
intriguing  aspect  shared  by  these  systems,  despite  the  fact  that  the  source  of  these  diatomic 
ligands  is  disparate.  [FeFe] -hydrogenase  assembly  requires  the  involvement  of  two  radical  SAM 
enzymes,  a  feature  that  noticeably  distinguishes  it  from  [NiFe]-hydrogenase  maturation. 

Future  Frontiers  of  Hydrogenase  Research 

Hydrogenases  are  unique  enzymes  and  the  recent  insights  into  their  diversity,  mechanism, 
and  maturation  have  revealed  many  surprises  that  represent  highly  unique  aspects  of 
bioinorganic  chemistry,  as  summarized  here.  Although  a  number  of  questions  have  been 
answered  through  the  surge  of  recent  work  in  this  area,  these  exciting  new  discoveries  also  serve 
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to  open  the  door  for  perhaps  more  lines  of  inquiry  than  they  have  closed.  It  is  likely  that 
hydrogenases  will  have  a  central  role  in  clarifying  the  newly  discovered  mechanism  of  energy 
conservation  tenned  electron  bifurcation  ([35]).  In  many  systems  these  bifurcating  reactions  are 
likely  key  points  for  controlling  and  balancing  electron  flow  in  metabolism  and  a  better 
fundamental  understanding  of  these  processes  could  potentially  provide  the  basis  for  proactively 
controlling  electron  flow  for  the  production  of  biofuels  in  engineered  systems. 

The  striking  functional  diversity  of  [FeFe] -hydrogenases  and  [NiFe]-hydrogenases  has 
interesting  evolutionary  implications  and  the  further  characterization  of  representatives  of  the 
various  diverse  classes  of  these  enzymes  will  lead  to  more  fascinating  evolutionary  insights. 
Most  would  agree  that  aspects  of  hydrogen  metabolism  must  have  been  a  component  of  the 
metabolism  of  early  life  and  this  appears  to  be  supported  for  the  [NiFe]-hydrogenases  but 
arguably  not  for  the  [FeFe]-hydrogenases.  The  traditional  dogma  and  natural  impetus  is  that 
hydrogenase  must  be  ancient  or  even  must  be  primordial  because  of  the  perceived  importance  of 
hydrogen  metabolism  for  early  life.  However,  in  the  case  of  [FeFe]-hydrogense,the  available 
data  including  their  taxonomic  occurrence  among  extant  organisms  is  inconsistent  with  an 
ancient  origin.  This  begs  the  question  of  whether  [NiFe]-hydrogenases  and  [FeFe] -hydrogenases 
are  both  ancient,  or  primordial,  and  what  the  selective  pressures  might  have  been  to 
independently  evolve  so-called  functionally  redundant  enzymes.  Further  analysis  of 
hydrogenase  diversity  in  the  context  of  their  modes  of  metabolism  could  provide  important  new 
clues  into  hydrogenase  ancestry  as  well  as  the  ancestry  of  chemiosmosis  and  respiration.  One 
driving  force  will  likely  be  the  close  evolutionary  relationship  between  one  specific  type  of 
[NiFe]-hydrogenase  (the  group  4  energy-conserving  enzymes)  and  the  ubiquitous  complex  I  of 
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the  aerobic  respiratory  chain.  How  did  the  latter  evolve  from  a  presumably  ^-metabolizing 
ancestor? 

Additional  insight  gained  into  the  relationship  between  sequence  variations  and  the 
respective  physiological  roles  for  hydrogenases  have  potential  to  further  identify  the  key 
structural  determinants  for  catalysis.  Hydrogenases  are  intriguing  models  for  proton  coupled 
electron  transfer  reactions  and  these  are  in  some  regards  very  special  enzymatic  reactions.  We 
are  taught  in  general  chemistry  of  the  separation  between  thermodynamics  and  kinetics  in 
chemical  reactions  and  that  catalysts  by  definition  exert  the  influence  solely  on  the  rates  of 
reactions  and  don’t  influence  reaction  thermodynamics.  However  for  enzymes  involved  in 
proton  coupled  electron  transfer  reactions  there  is  clear  evidence  that  the  properties  of  the 
enzymes  themselves  can  influence  the  thermodynamics  of  reactions  by  influencing  the  local 
concentration  of  substrates  and/or  products.  The  chemical  character  and  availability  of  proton 
donors  and  acceptor  groups  in  the  active  site  and  the  oxidation-reduction  potential  of  active  site 
metal  and  accessory  clusters  can  essentially  tune  the  concentration  of  protons  and  electrons  and 
influence  the  thermodynamics  of  hydrogenase  reactions.  The  extent  of  this  influence  is  not  well 
understood  and  the  simplicity  of  the  hydrogenase  reaction  makes  this  an  ideal  model  system  for 
delineating  the  degree  of  this  control,  potentially  significantly  impacting  biotechnology 
particularly  in  the  area  of  biofuel  production. 

For  [NiFe]-  and  [FeFe]-hydrogenases,  as  detailed  in  this  review,  there  have  been  significant 
advances  in  our  understanding  of  nonprotein  ligand  biosynthesis  and  protein  maturation.  There 
is,  however,  still  much  more  work  to  be  done.  The  illumination  of  the  pathways  for  nonprotein 
ligand  biosynthesis  can  be  linked  to  a  classic  metabolic  pathway  elucidation  problem  in  which 
the  substrates  are  not  defined  and  as  such  present  significant  challenges  to  researchers. 
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Advances  over  the  last  decade  have  succeeded  in  identifying  the  substrates  for  carbon  monoxide 
and  cyanide  synthesis  but  there  are  still  a  lot  of  mechanistic  details  that  need  to  be  resolved.  For 
the  [NiFe]-hydrogenase  diatomic  ligand  biosynthesis,  the  existence  of  Fe-CCh  FTIR  bands  both 
in  HypC  alone  and  when  HypC  is  in  a  complex  with  HypD  has  led  to  the  recent  hypothesis  that 
HypC  delivers  an  Fe-CCE  group  to  HypD  where  the  CO2  is  reduced  to  CO.  Experimental 
validation  is  needed,  both  for  the  transfer  and  delivery  of  this  species  from  HypC  to  HypD,  as 
well  as  to  the  source  of  the  CO  ligand  being  CO2.  For  [FeFe]-hydrogenase  diatomic  ligand 
biosynthesis  the  presence  of  Fe-CO/CN  species  which  form  during  HydG  catalysis  leads  to  the 
hypothesis  that  the  C-terminal  cluster  is  cannibalized  in  this  process  and  that  the  iron-diatomic 
group(s)  are  then  mobilized  to  HydF.  Clearly  for  [FeFe]-hydrogenase  H  cluster  biosynthesis 
there  are  conflicting  results  and  differing  perspectives  on  many  aspects  of  the  mechanism  of  H 
cluster  assembly.  Some  of  these  likely  reflect  differences  in  experimental  design,  in  particular 
how  these  maturation  proteins  are  expressed  and  analyzed.  It  will  be  the  challenge  of  the  next 
generation  of  maturation  studies  to  delineate  the  relevant  insights  being  gleaned  for  all  the 
expected  approaches  in  the  context  of  physiological  or  in  vivo  assembly. 

The  culmination  of  the  new  insights  into  hydrogenase  diversity,  mechanism,  and 
maturation  presented  herein  should  have  one  convinced  of  the  amazing  qualities  of  hydrogenases 
as  complex  iron  sulfur  enzymes  that  span  evolutionary  time  and  which  played  a  key  role  in  the 
physiological  diversification  of  life.  These  enzymes  provide  a  one  stop  model  system  for  studies 
of  the  evolution  of  early  life  processes,  control  of  biological  electron  flow  in  metabolism,  proton 
coupled  electron  transfer  reactions  in  biological  systems,  complex  metal  cluster  assembly,  and 
the  origin  of  modern  day  aerobic  respiration. 
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Figure  1.  Distribution  of  [FeFe]-hydrogenase  (A)  and  [NiFe]-hydrogenase  mapped  on  a  “universal”  taxonomic  tree 
of  life.  Lineages  with  at  least  one  homolog  of  each  enzyme  are  overlaid  in  blue.  Parsimony  rules  were  used  to  map 
the  ancestral  character  of  extinct  lineages  (e.g.,  LUCA). 
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Figure  2.  Schematic  representation  of  the  structural  variation  of  [FeFe]-hydrogenase  homologs  present  in  sequence 
databases  as  adapted  from  Meyer,  2007  and  more  recently  Calusinska  et  al.,  2010.  Panel  A  illustrates  the  primary 
structural  classes  while  panel  B  illustrates  the  remarkable  structural  variation  in  within  the  M2  structural  subclass. 
Abbreviations:  CODH,  carbon  monoxide  dehydrogenase;  TRX:  Thioredoxin  domain;  NuoF:  homolog  subunit  F  of 
NADH  dehydrogenase;  OX:  oxidoreductase  domain;  NuoG:  homolog  of  subunit  G  ofNADH  dehydrogenase;  IDF: 
Indolepyruvate:ferredoxin  domain;  ACS:  Acetyl  CoA  dehydrogenase;  SLBB:  soluble-ligand-binding  b-grasp  fold 
domain;  RRR:  Rubredoxin-rubrerhythrin-rubredoxin  domain;  PAS:  PAS/P  AC  domain. 
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Figure  3.  Hypothetical  depiction  of  the  distribution  specific  groups  of  organisms  and  associated  [FeFe]-  and 
[NiFe]-hydrogenases  (designated  by  “G”)  that  function  to  couple  the  oxidation  or  reduction  of  substrates  along  a 
gradient  in  the  availability  of  oxidants.  (B)  Phylogenetic  reconstruction  of  representatives  of  [NiFe]-hydrogenase 
and  related  paralogs  (NuoBD,  MbxJL)  with  group  designations  indicated.  Figure  adapted  from  Boyd  et  ah,  2014. 
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Figure  4.  X-ray  structures  of  binuclear  cofactors  of  the  hydrogenase  catalytic  sites.  (A)  The  Ni-Fe  cluster  of 
[NiFe(Se)]-hydrogenases.  (B)  The  2Fe  subcluster  of  the  [FeFe]-hydrogenase  H  cluster.  Images  rendered  from  PDB 
codes  2FRV  ([NiFe]  hydrogenase)  [262]  and  3C8Y  ([FeFe]  hydrogenase  Cpl  from  Clostridium  pasteurianum).[263\ 
The  H-bonding  interactions  (dashed  lines,  black  font)  and  conserved,  exchangeable  groups  (dashed  lines,  purple 
font)  are  labeled.  Ni,  nickel;  Se,  selenium;  Fe,  iron. 
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Figure  5.  Model  of  the  H2  binding,  and  the  initial  activation  steps,  catalyzed  by  [NiFe(Se)]  hydrogenases.  Scheme 
(A)  proposed  by  Bruschi,  M.,  et  al.  [107].  Scheme  (B)  is  a  summary  of  activation  proposed  by  Wu  and  Hall  [108]. 
Green  signifies  electron  transfer  reactions. 
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Figure  6.  Model  for  the  H2  binding  and  activation  steps  by  [FeFe]-hydrogenases.  Two  alternative  schemes  are 
presented  based  on  DFT  and  spectroscopic  studies  of  [FeFe]-hydrogenases.  Scheme  (A),  the  resting  Flox  state  is 
assigned  as  a  [4Fe-4S]2h  subcluster  and  a  2Fe  subcluster  as  Fepn  /  Fej1  based  on  FTIR  and  EPR  spectra  and  DFT 
calculations  of  oxidized  Cp\  and  CrHydA  [120,  137,  139,  140],  and  involves  a  diferrous  2Fe  intermediate  for  H2 
binding  and  activation.  Scheme  (B),  the  resting  Hox  state  is  assigned  as  [4Fe-4S]21  and  2Fe  as  Fep'  /  Fej11  [104,  115, 
141,  144],  where  H2  binding  and  activation  occurs  on  a  mixed  valent  2Fe  at  the  Fe1  site.  Green  identifies 
intramolecular  ET  steps.  The  initial  PT  step  of  H2  activation  from  Fed  to  the  bridgehead  amine  ligand  is  shown  in 
red. 
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Figure  7.  Thiocyanate  biosynthesis  in  [NiFe]-hydrogenase 
maturation.  HypF  acts  on  carbamoylphosphate  (I), 
converting  it  first  to  carbamate  (II),  and  then  to 
carbamoyladenylate  (III).  The  carbamoyl  functional  group  is 
then  mobilized  to  a  cysteine  on  HypE’s  C-terminus  (IV), 
where  it  is  converted  to  thiocyanate  (V). 
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Figure  8.  Fe(CN)2CO  biosynthesis  and  large  subunit  processing  in  [NiFe]-hydrogenase 
maturation.  Thiocyanate  delivery  from  HypE  to  the  HypC-HypD  complex  results  in  HypC 
dissociation.  The  Fe(CN)2CO  unit  bound  to  HypD  (E.  coli  amino  acid  numbering)  is  then 
transferred  to  the  large,  precursor  subunit  of  [NiFe]-hydrogenase.  While  the  source  of  CO  is 
unresolved,  recent  biochemical  evidence  has  been  presented  showing  the  association  of  C02 
with  HypC  (see  main  text),  suggesting  that  HypC  delivers  Fe-C02  to  HypD  where  it  is 
reduced  to  CO.  Accordingly,  C02  is  shown  here  to  be  bound  to  the  HypC-HypD  complex. 
Following  insertion  of  Ni2+,  a  short  C-terminal  peptide  is  proteolytically  processed,  affording 
the  mature  [NiFe]-hydrogenase  enzyme. 
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Figure  9.  Maturation  scheme  detailing  hypothetic  2Fe  subcluster  biosynthesis  in  [FeFe]- 
hydrogenase  maturation.  HydG  reductive ly  cleaves  SAM  into  methionine  and  a  5’- 
deoxyadenosyl  radical  species;  following  H-atom  abstraction  from  the  para  phenolic 
position,  tyrosine  undergoes  heterolytic  Ca-C|3  bond  cleavage  to  generate  a  /; -ere so  late 
radical  and  dehydroglycine.  CO  and  CN  are  subsequently  formed  from  dehydroglycine  by 
an  unknown  mechanism  involving  the  site-differentiated,  C-terminal  [4Fe-4S]  cluster  of 
HydG.  Stopped-flow  FTIR  studies  have  shown  the  existence  of  Fe-CO-CN  species  that 
form  during  HydG  catalysis  (see  main  text),  leading  to  the  hypothesis  that  the  C-terminal 
cluster  is  cannibalized  during  biosynthesis.  Maturation  of  the  2Fe  subcluster  is  completed  via 
dithiomethylamine  synthesis  by  HydE.  HydF  acts  as  a  scaffold  or  carrier  protein  during  the 
assembly  process  and  transfers  the  2Fe  subcluster  to  HydA,EFG  to  achieve  [FeFe]- 
hydrogenase  activation. 
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[FeFe]  hydrogenases  catalyze  the  reversible  reduction  of  protons  to  yield  H2.  Attempts  to  use 
these  enzymes  in  scale  up  production  of  H2  or  other  reduced  bio-products  have  been  limited  by 
their  sensitivity  to  oxygen.  Here,  we  apply  a  PCR-directed  approach  to  determine  the 


distribution,  abundance,  and  diversity  of  hydA  gene  fragments  along  co-varying  salinity  and  02 

#  j,  J 

gradients  in  a  vertical  water  column  environment  of  Great  Salt  Lake,  Utah.  The  distribution  of 
hydA  was  constrained  to  water  column  transects  that  had  high  salt  and  reduced,  albeit  measurable 
02  concentrations.  The  presence  of  [FeFe]-hydrogenase  in  these  oxygen  containing  transects 
may  be  due  to  the  lower  solubility  and  bioavailability  of  02  in  saline  solutions.  Recovered  HydA 
protein  fragments  were  predominantly  bacterial  and  harbored  insertions  in  regions  of  the  protein 


that  fonn  gas  channels  that  lead  to  the  02-labile  FeS  cubane.  HydA  sequences  recovered  from 


GSL  were  enriched  in  hydrophilic  amino  acids  when  compared  to  HydA  from  less  saline 
environments,  suggesting  adaptation  to  minimize  salt-enhanced  hydrophobic  interactions  during 
protein  folding.  A  phylogenetic  framework  was  created  for  use  in  predicting  the  accessory 


cluster  composition  and  quaternary  structure  of  recovered  HydA  protein  sequences.  HydA 
recovered  from  the  GSL  water  column  are  predicted  to  harbor  numerous  N-  and  C-tenninal 
accessory  iron-sulfur  cluster  binding  domains  and  form  trimers;  characteristics  typical  of 


bifurcating  hydrogenase.  This  study  indicates  an  important  role  for  H2-based  bifurcating  [FeFe]- 
hydrogenases  in  the  functioning  of  the  GSL  ecosystem  and  provides  new  target  enzymes  and 
variants  for  use  in  identifying  an  02  tolerant  enzymes  for  use  in  biotechnological  applications. 
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INTRODUCTION 


Hydrogen  (H2)  is  a  diffusible  electron  carrier,  has  the  highest  energy  content  per  unit 
mass  of  all  naturally  occurring  fuels,  and  forms  the  basis  of  numerous  interspecies  interactions  in 
natural  microbial  communities  {Mclnerney,  2009  #212;Schink,  1997  #82}.  [NiFe]-  and  [FeFej- 


nncma] 


rer  e 


hydrogenase,  which  differ  in  the  metal  composition  of  their  active  site  cluster,  are  principally 
responsible  for  H2  cycling  in  natural  environments.  In  general,  [FeFej-hydrogenases  are  found 
in  anaerobic  bacteria  and  are  especially  prevalent  among  the  fermentative  organisms  (e.g., 
firmicutes)  where  they  typically  function  in  the  regeneration  of  reduced  electron  carriers 


[NAD(P)H,  ferredoxin  (Fd)]  coupled  to  the  reduction  of  protons.  They  are  also  found  in  a 

number  of  lower  eukaryotes  including  algae  and  protists  but  surprisingly  they  have  yet  to  be 

J  ^ 

found  in  cyanobacteria  or  in  archaea  {Meyer,  2007  #7}.  In  contrast,  [NiFej-hydrogenases,  are 
frequently  found  in  archaea,  in  addition  to  their  common  occurrence  in  a  large  number  of 
bacteria  {Vignais,  2007  #16;Vignais,  2001  #8}. 

V 


V 


The  majority  of  H2  in  natural  environments  is  produced  through  fermentative  processes 
{Conrad,  1996  #218}.  While  both  classes  of  hydrogenase  can  function  in  the  generation  of  H2 
during  fermentation  {Meyer,  2007  #7;Vignais,  2007  #16;Vignais,  2001  #8},  [FeFe] -hydrogenase 
are  generally  more  efficient  H2  producing  catalysts  {Cammack,  1999  #213}  and  such  have  been 
subjected  to  extensive  biochemical  and  structural  characterization  in  an  effort  to  optimize  these 
enzymes  for  efficient  bio-hydrogen  production  {Mulder,  2010  #214;Peters,  1998  #101;Posewitz, 
2008  #12;Stapleton,  2010  #230; Stapleton,  2010  #231}.  The  best  characterized  [FeFej- 
hydrogenases  are  monomeric,  Fd-dependent  enzymes  {Meyer,  2007  #7;Vignais,  2001  #8}. 
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Examples  of  these  enzymes  include  those  from  eukaryotic  algae  such  as  Chlamydomonas 
reinhardtii  {Mulder,  2010  #214;Meuser,  2012  #215;Stripp,  2009  #216}  and  clostridial  species 
such  as  Clostridium  pasteurianum  {Peters,  1998  #101}.  Whereas  C.  reinhardtii  [FeFe]- 
hydrogenases  comprise  only  the  active  site  H-cluster,  [FeFe]-hydrogenase  from  C. 
pasteurianum  contains  an  additional  three  [4Fe-4S]  cluster  binding  domains  in  the  N  terminus 
which  may  function  to  allow  coupling  with  electron  donors  with  differing  redox  potentials. 

#  j,  J 

Characterization  of  [FeFe]-hydrogenase  diversity  in  genome  sequences  {Calusinska,  2010 
#210;Meyer,  2007  #7}  noted  substantial  variation  in  N-  and  C-tenninal  Fe-S  cluster  and 

s'O' 

accessory  co-factor  binding  motifs  (Fig.  3),  suggesting  potential  interactions  with  a  variety  of 
redox  partners.  In  addition,  such  analyses  identified  differences  in  [FeFeJ-hydrogenase 
quaternary  structure,  including  the  identification  of  multimeric  homologs  that  likely  form  dimers 


trimers,  or  tetramers  {Calusinska,  2010  #210;Meyer,  2007  #7}. 


i  munmieric  i 

J7JS 

pr  7007  ttll 


. 


Biochemical  characterization  of  multimeric  [FeFe]-hydrogenase  enzymes  indicate  that 
they  are  pyridine  nucleotide  (NAD(P)Fl-  or  NADH)  linked  H2  producing  enzymes  {Malki,  1995 
#219;Soboh,  2004  #220;Verhagen,  2001  #221;Malki,  1995  #219;Soboh,  2004  #220;Verhagen, 


2001  #221}.  The  trimeric  hydrogenase  from  T.  maritima  was  shown  to  require  both  reduced  Fd 
and  NADH  for,efficient  H2  production  {Schut,  2009  #222}.  Conversely,  the  [FeFej- 
hydrogenase  from  Acetobacterium  woodii  simultaneously  reduces  NAD+  and  ferredoxin  (Fd) 
during  H2  oxidation.  The  key  to  the  reversible  oxidation  of  H2  and  simultaneous  NAD+  and  Fd 
reduction  is  the  coupling  of  their  energetics;  the  exergonic  reduction  of  NAD +  by  electrons 
derived  from  H2  allows  the  endergonic  reduction  of  Fd  in  a  process  now  termed  electron 
bifurcation  {Herrmann,  2008  #223;Buckel,  2013  #224}.  Thus,  these  enzymes  function  to  create 


4 


electrons  with  a  very  low  redox  potential  by  simultaneously  creating  an  electron  with  a  high 
redox  potential  {Buckel,  2013  #224;Wang,  2013  #225;Wang,  2013  #226;Schut,  2009  #222}. 


While  the  aforementioned  examples  indicate  a  directionality  in  the  bifurcating  process,  [FeFe]- 
hydrogenases  that  bifurcate  can  in  principle  function  in  vivo  in  both  the  production  and  oxidation 
of  H2,  as  was  recently  shown  for  a  bifurcating  hydrogenase  in  Moorella  thermoacetica  {Wang, 
2013  #226}.  During  growth  on  glucose,  M.  thermoacetica  can  intermittingly  produce  H2  from 

♦  u 

Fd  and  NADH  via  the  bifurcating  hydrogenase.  Likewise,  this  same  hydrogenase  can  utilize 
produced  H2  and  C02  to  prefonn  acetogenesis  generating  3  mol  of  acetate  per  glucose.  Although 

X  ^ 

M.  thermoacetica  does  not  grow  well  autotrophically  on  H2  and  C02,  it  uses  the  bifurcating 

* 

hydrogenase  to  produce  reduced  Fd  to  drive  acetogenesis.  The  reversible  production  of  such  low 


potential  metabolic  electrons  through  the  process  of  H2  dependent  electron  bifurcation  is  of 

is i  ^ 

significant  biotechnological  interest  since  they  could  be  modulated  in  vivo  and  directed  toward 
the  controlled  production  of  highly  reduced  biofuel  products. 


117  Multimeric  bifurcating  [FeFe] -hydrogenase  complexes  now  appear  to  be  commonplace 

118  among  anaerobic  bacteria  {Calusinska,  2010  #210;Meyer,  2007  #7}.  However,  like  the 

119  monomeric  forms  of  [FeFe] -hydrogenase,  multimeric  enzymes  are  oxygen  sensitive  which 

120  presents  challenges  in  the  application  of  such  enzymes  in  large  scale  production  efforts 

121  {Juszczak,  1991  #232}.  Numerous  biochemical  and  structural  studies  have  been  conducted  to 

122  shed  light  on  the  basis  of  the  oxygen  sensitivity  of  [FeFe]-hydrogenase.  One  approach  to  address 

123  this  problem  is  gene  shuffling  or  randomized  mutation,  which  generates  a  diverse  recombinant 

124  hydrogenase  library  to  screen  for  enhanced  02  tolerance  and/or  stability  {Nagy,  2007 

125  #5;Stapleton,  2010  #230; Stapleton,  2010  #231;Bingham,  2012  #235}.  Another  approach  is  to 
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search  natural  diversity  of  these  enzymes  as  they  are  distributed  across  geochemical  gradients 
(e.g.,  O2)  through  the  use  of  PCR  or  metagenomic-directed  approaches  {Boyd,  2010  #191;Boyd, 
2009  #108;Schmidt,  2010  #233;Chivian,  2008  #144;Wamecke,  2007  #147}.  A  previous 
application  of  a  PCR-directed  approach  targeting  a  fragment  of  the  gene  encoding  the  large 
subunit  ( HydA )  of  [FeFej-hydrogenase  in  a  phototrophic  mat  community  identified  a  number  of 

v 

substitutions  and  insertions  in  regions  of  the  protein  that  coordinate  the  O2  labile  active  site  FeS 

♦  u 

cubane  {Boyd,  2009  #108}.  While  the  functional  implications  of  these  substitutions/insertions 
have  yet  to  be  followed  up  on  biochemically,  such  findings  provide  impetus  to  continue 

/  C/ 

searching  natural  systems  for  potential  variants  capable  of  tolerating  higher  O?  concentrations. 

< 2 7 

Due  to  the  limited  supply  of  freshwater  any  large  scale  up  production  of  biofuels  is  likely 
to  employ  salt  water,  creating  the  need  to  identify  salt  tolerant  enzymes  and  organisms  in 
addition  to  enzymes  that  also  function  in  the  presence  of  O2  {Beer,  2009  #126}.  Since  both  the 
solubility  and  the  diffusion  rate  of  O2  are  lower  in  (hyper)saline  solutions  {Lange,  1972  #237},  it 
is  possible  that  salt  tolerant  [FeFe]-hydrogenase  may  maintain  functionality  in  the  presence  of 
higher  apparent  concentrations  of  O2  because  cells  are  able  to  maintain  a  lower  effective 


(bioavailable)  intracellular  concentration.  Here,  we  apply  a  PCR-directed  approach  to  detennine 
the  distributioi^and  diversity  of  hyclA  gene  fragments  in  a  vertical  water  column  environment  of 
Great  Salt  Lake,  Utah  which  exhibits  strong  and  co-varying  gradients  in  salt,  oxygen, 
photosynthetically  active  radiation,  pH,  and  temperature  {Meuser,  2013  #209}.  A  phylogenetic 
framework  was  created  for  use  in  predicting  the  accessory  cluster  composition  and  quaternary 
structure  of  HydA  protein  sequences  obtained  along  the  vertical  gradient.  The  results  indicate  an 
abundance  of  multimeric  [FeFej-hydrogenase  that  are  putatively  involved  in  electron  bifurcating 
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processes.  Recovered  HydA  protein  fragments  revealed  evidence  of  adaptation  to  elevated  salt 
concentrations  and  harbored  variations  in  regions  of  the  protein  that  fonn  gas  channels  leading  to 
the  02-labile  FeS  cubane  of  the  active  site.  Similarly,  novel  substitutions  were  identified  in 
residues  that  coordinate  the  Cfi  labile  active  site.  This  study  demonstrates  that  enzyme  variants 
with  desired  properties  can  potentially  be  recovered  by  examining  protein  diversity  in  microbial 
assemblages  that  have  evolved  in  the  presence  of  a  particular  environmental  stress. 


METHODS 


a, 


to 


tress. 

\V 

o 


- _ robia 

V 


Site  Description  and  Sample  Collection.  Samples  were  collected  from  the  Division  of 
Wildlife  and  Resources  sample  site  number  3  (DWR3:  Latitude  41.16746,  Longitude  - 
1 12.66961 17)  in  June,  2007  (surface,  1M,  4M,  6M,  6.5M,  and  8M  depths  and  benthic 
sediments),  by  lowering  a  2L  Kemmerer  collection  bottle  (Wildlife  Supply,  Yulee,  FL,  USA)  to 
each  depth  sampled,  with  deionized  water  rinses  between  each  sampling.  Samples  were 
transferred  to  sterile  collection  bottles  leaving  minimal  headspace  volume,  stored  on  ice,  and 
transported  to  the  laboratory.  Cells  were  then  pelleted  by  centrifugation  (4000  g  for  15  min  at 
3°C  )  using  a  Sorvall  centrifuge  (Thenno  Fisher,  Waltham,  MA,  USA),  the  supernatant  decanted, 
and  biomass  pellets  resuspended  in  1  mL  of  supernatant  and  stored  at  -80°C  until  used  for  DNA 
extraction. 


Physical  and  chemical  analysis.  Chemical  and  physical  measurements  were  compiled 
as  a  part  of  a  previous  study  conducted  on  the  DWR3  water  column  {Meuser,  2013  #209}. 
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Briefly,  quantum  scalar  irradiance  of  total  available  photo  synthetic  active  radiation  (PAR,  400- 
700  nm  photons-m '2-s'1)  was  determined  using  a  Li-Cor  LI- 193  underwater  spherical  quantum 
sensor  and  LI-250A  light  meter  (Li-Cor  Biosciences,  Lincoln,  Nebraska,  USA).  A  Troll  9500 
multi-sensor  (In-Situ  Inc.,  Fort  Collins,  CO,  USA)  was  used  to  simultaneously  acquire  pH, 
temperature,  salinity,  and  dissolved  O2  data  at  each  DWR3  site  sampling  depth. 

DNA  Extraction,  Amplification,  Cloning,  and  Sequencing  of  hydA. 

extracted  from  ~  200  mg  aliquots  of  biomass  using  the  PowerSoil™  DNA  Isolation  Kit  (MoBio 
Inc.,  Carlsbad,  CA,  USA)  following  the  manufacturer’s  instructions.  DNA  was  quantified 
fluorometrically  as  previously  described  {Boyd,  2007  #107}.  All  DNA  extracts  were  screened 
for  the  presence  of  16S  rRNA  genes  to  ensure  the  presence  of  PCR-amplifiable  DNA  using  10 
ng  of  DNA  and  primers  bacterial  1070F  (5’-ATGGCTGTCGTCAGCT-3’)  and  universal  1492R 
(5’-GGTTACCTTGTTACGACTT-3’)  {Boyd,  2009  #142}.  Approximately  500  bp  fragments  of 
hydA  were  PCR-amplified  in  triplicate  from  10  ng  of  environmental  genomic  DNA  as  template 
and  primer  pair  FeFe-272F  and  FeFe-427R  using  previously  established  reagent  concentrations 
and  reaction  conditions  {Boyd,  2009  #108}.  Equal  volumes  of  each  replicate  amplification  were 
pooled  and  were  purified  using  the  Wizard  PCR  Preps  DNA  purification  system  (Promega, 
Madison,  WI),  quantified  using  the  Low  DNA  Mass  Ladder  (Invitrogen,  Carlsbad,  CA),  cloned 
using  the  pGem-T  Easy  Vector  System  (Promega),  and  sequenced  using  the  M13F-M13R  primer 
pair  as  previously  described  {Boyd,  2009  #142}.  A  total  of  215  hydA  gene  sequences  were 
sequenced  in  the  present  study  and  representatives  of  each  phylotype  (supplemental  table  1)  have 
been  deposited  in  the  GenBank,  DDBJ,  and  EMBL  databases  under  the  accession  numbers 
HM636647-HM636798. 
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Primary  Sequence  Analysis.  MEGA  (ver.  4.0.1)  {Tamura,  2007  #88}  was  used  to 
translate  and  alisng  hydA  sequences  specifying  the  Gonnet  250  protein  weight  matrix  with  a 
pairwise  alignment  gap  opening  penalty  of  13  and  gap  extension  penalty  of  0.05.  The  aligned 
HydA  sequences  were  screened  for  the  presence  of  the  LI  ([FLI]TSC[C/S]P[GAS]W[VIQH]) 
and  L2  ([IVLF]MPCx[ASRD]K[KQ]xE)  (conserved  residues  are  in  bold  and  underlined,  and 


bracketed  positions  indicate  “semiconserved”  residues  at  that  position)  as  outlined  previously 
{Meyer,  2007  #7;Vignais,  2001  #8}.  ClustalX  {Larkin,  2007  #89}  was  also  used  to  create  a 
pairwise  sequence  identity  matrix  which  was  subsequently  imported  into  DOTUR  {Schloss, 
2005  #36}  to  identify  and  group  operational  taxonomic  units  (OTUs)  and  to  perform  rarefaction 
analyses  at  a  sequence  identity  threshold  of  0.01.  Indices  describing  the  hydrophobicity  of 
inferred  protein  sequences  (aliphatic  and  grand  average  of  hydropathicity  [GRAVY]  indices) 
were  computed  using  the  ProtParam  tool  available  on  the  ExPASy  proteomics  server 
(http://au.expasy.org/tools/protparam.html)  {Gasteiger,  2005  #35}.  The  aliphatic  index  is  the 
relative  volume  occupied  by  aliphatic  amino  acid  side  chains  (alanine,  valine,  isoleucine,  and 
leucine)  in  a  given  protein  or  protein  fragment  and  is  calculated  according  to  the  formula  X(Ala) 
+  a  *  X(Val)  +  b  *  (  X(Ile)  +  X(Leu)  )  where  X(Ala),  X(Val),  X(Ile),  and  X(Leu)  are  mole 
percent  (100  *  mole  fraction)  of  alanine,  valine,  isoleucine,  and  leucine  and  a  and  b  are  the 


relative  volume  coefficients  for  valine  (a=2.9)  and  Leu/Ile  (b=3.9)  relative  to  alanine  {Kyte, 
1982  #37},  with  higher  indices  indicative  of  a  higher  content  of  aliphatic  side  chains  in  the 
protein.  The  GRAVY  index  is  the  sum  of  the  hydropathy  values  for  each  amino  acid  in  a 
sequence  divided  by  the  number  of  residues  {Kyte,  1982  #37},  with  positive  GRAVY  indices 
indicative  of  hydrophobicity  and  negative  GRAVY  indices  indicative  of  hydrophilicity.  For 
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comparison,  aliphatic  and  GRAVY  indices  were  computed  for  putative  HydA  previously 
recovered  from  microbial  mats  inhabiting  salterns  from  Guerrero  Negro  (GN),  Mexico 
(GenBank:  FJ623894-FJ623958)  {Boyd,  2009  #108}  and  from  geothermal  springs  in 
Yellowstone  National  Park  (YNP),  Wyoming  (GenBank:  GU362773-GU362867)  {Boyd,  2010 


#191}  (Supplemental  Table  2).  Intervals  of  0.04  GRAVY  index  units  were  empirically  selected 

for  use  in  binning  GRAVY  indices  for  sequences  recovered  from  the  three  gcochcmicall}% 

♦  j,  J 

distinct  environments  (GSL,  YNP,  GN).  Indices  were  rounded  downward  to  the  nearest  0.04 
unit  interval  prior  to  binning.  P- values  were  obtained  by  using  a  two-tailed  Student  t  test. 


luster  Coi 


Prediction  of  Environmental  HydA  Accessary  Cluster  Composition.  The  sequences 

used  to  develop  the  F-  and  C-cluster  classification  and  tertiary  structure  scheme  developed  by 

J 

Meyer  2007  and  further  augmented  by  Calusinska  et  al.,  2010  {Calusinska,  2010  #210}  were 
compiled  and  used  to  create  a  database  for  use  in  predicting  the  accessory  cluster  composition 
and  tertiary  structure  of  putative  HydA  fragments  recovered  from  the  GSL  water  column  (Supp. 
Table  3).  Additional  sequences  representing  the  closest  BLASTp  hits  for  each  OTU  were  also 


compiled  and  characterized  for  F-  and  C-cluster  composition  and  tertiary  structure  scheme  using 
the  criteria  of  Calusinska  et  al.,  2010  {Calusinska,  2010  #210}  and  the  Conserved  Domain 
Database  as  implemented  with  BLASTp  {Marchler-Bauer,  2009  #211}.  The  classification  of 
these  proteins,  along  with  those  reported  by  Calusinska  et  al.,  2010,  are  reported  in  Supplemental 
Table  3  and  a  schematic  illustrating  their  F-  and  C-cluster  structural  variation  is  presented  in  Fig. 


3. 
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HydA  protein  sequences  from  GSL  and  from  our  database  were  aligned,  trimmed  to  the 
length  of  the  amplified  protein  fragment  (positions  272  to  427  in  HydA  from  Clostridium 
pasteurianum),  and  subjected  to  Maximum  Likelihood  phylogenetic  reconstruction  as  described 
previously  {Boyd,  2010  #191}.  The  C-  and  F-cluster  composition  and  tertiary  structure  of 
environmental  HydA  was  then  predicted  by  phylogenetic  clustering  with  HydA  comprising  our 


LMIIg  UUI 

V 


database.  Sequences  that  formed  lineages  without  a  representative  from  our  database  were 

♦  u 

classified  as  “unknown”  with  respect  to  F-  and  C-cluster  composition  and  tertiary  structure. 


hydA  qPCR.  Quantitative  PCR  (qPCR)  was  used  to  estimate  the  number  of  hydA 
templates  in  GSL  DNA  extracts  according  to  previously  described  methods  {Boyd,  2011  #246}. 
Standard  curves  were  generated  from  plasmid  DNA  containing  hydA  amplified  from  the  GSL 
sample  sites.  Three  plasmid  clones  were  used  in  generating  a  standard  curve  that  relates  template 


copy  number  to  threshold  qPCR  amplification  signal.  The  threshold  amplification  signal  as  a 


function  of  copy  number  varied  by  less  than  1  cycle  for  each  of  the  three  plasmid  clones;  thus, 
the  standard  curves  generated  using  each  plasmid  clone  were  averaged  for  use  in  calculating  the 
average  template  abundances  and  standard  deviation  in  template  abundances  from  replicate 


qPCRs.  A  standard  curve  was  generated  over  6  orders  of  magnitude  ranging  from  9.1  x  101  to 
1.1  xio7  copies  of  template  per  assay  (R2  =  0.995). 

O' 

qPCR  assays  were  performed  in  a  Rotor-Gene  300  quantitative  real-time  PCR  machine 
(Qiagen,  Valencia,  CA)  in  0.5  mL  optically  clear  PCR  tubes  (Qiagen,  Valencia,  CA)  using  a 
SsoFast™  EvaGreen  Supermix  qPCR  Kit  (Bio-Rad  Laboratories,  Hercules,  CA).  Assay 
reactions  were  amended  to  a  final  concentration  of  0.4  mg  ml"1  molecular-grade  bovine  serum 
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albumin  (Roche,  Indianapolis,  IN).  qPCR  cycling  conditions  were  as  follows:  initial  denaturation 
(95°C  for  10  min)  followed  by  40  cycles  of  denaturation  (95°C  for  10s),  annealing  (56.5°C  for 
15s),  and  extension  (72°C  for  20  s).  Specificity  of  the  qPCR  assays  was  verified  by  melt  curve 
analysis.  The  reported  template  abundances  are  the  average  and  standard  deviation  of  qPCR 
assays  performed  in  triplicate  for  each  gene. 


.<7 


HydA  Phylogenetic  Diversity.  The  phylogenetic  position  of  hydA  homologs  was 
assessed  using  PhyML-aBayes  {Anisimova  M.,  2011  #204}  with  the  nuclear  prelamin  A 


-tear  pr 
related 


recognition  factor-like  protein-encoding  gene  (XM_4 14836),  a  distantly  related  homolog  of 
hydA  {Meyer,  2007  #7},  from  Gallus  gallus  serving  as  the  outgroup.  The  General  Time 
Reversible  (GTR)  substitution  model  with  gamma-shaped  rate  variation  with  a  proportion  of 
invariable  sites  was  specified  in  the  calculation,  as  recommended  by  Modeltest  (ver.  3.)  {Posada, 
2006  #189}.  The  nuclear  prelamin  A  recognition  factor-like  protein-encoding  gene 
(XM_4 14836),  a  distantly  related  homolog  of  hydA  {Meyer,  2007  #7},  from  Gallus  gallus 
served  as  the  outgroup.  The  consensus  phylogram  was  rate-smoothed  using  the 
multidimensional  version  of  Rambaut’s  parameterization  as  implemented  in  PAUP  (ver.  4.0) 


{Swofford,  2001  #94}.  Rate-smoothing  was  performed  according  to  the  parameters  identified 
using  Modeltest.  This  included  the  identification  of  the  substitution  model,  the  gamma 
distribution  of  rate  variation  across  sites,  the  proportion  of  invariant  sites,  nucleobase 
frequencies,  and  the  rate  matrix  for  each  phylogram.  Phylocom  (ver.  4.0. 1)  {Webb,  2008  #1 14} 
was  used  to  calculate  Faith’s  index  of  phylogenetic  diversity  (PD)  using  the  rate  smoothed 
Bayesian  chronogram.  PD  is  the  proportion  of  total  branch  length  in  the  phylogeny  associated 
with  the  taxa  in  a  given  sample  or  assemblage.  A  higher  PD  index  for  an  assemblage  is 
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indicative  of  higher  phylogenetic  diversity  (phylogenetic  richness)  relative  to  the  total  sequence 
pool. 


RESULTS 


Water  column  chemistry.  Strong  and  often  co-varying  physical  and  chemical  gradients 

#  j,  J 

were  observed  in  the  DWR3  vertical  water  column  (Table  1).  Salinity  did  not  vary  substantially 
over  the  interval  from  the  surface  to  a  depth  of  6.0  meters  but  increased  from  126  ppt  at  6.0 
meters  to  203  and  247  ppt  at  depths  of  6.5  and  8.0  meters,  respectively.  Likewise,  water  column 


temperature  was  nearly  constant  (23-24°C)  over  the  0.0  to  6.0  meter  vertical  transect,  but 

■s  (zu  and  17°C 

presence  of  an  inverted  thermocline  between  depth  intervals  of  6.0  and  8.0  meters.  pH  varied 


decreased  abruptly  at  a  depth  of  6.5  and  8.0  meters  (20  and  17°C,  respectively),  indicating  the 


from  5.95  to  8.05  over  the  eight  meter  depth,  with  the  highest  pH  observed  at  a  depth  of  6.0 
meters  and  the  lowest  pH  observed  at  8.0  meters.  Photosynthetically  active  radiation  (PAR) 
decreased  systematically  from  1800  pM  at  the  surface  to  175  uM  at  a  depth  of  4.0  meters  and 
was  below  detection  at  a  depth  of  8.0  meters.  Dissolved  oxygen  varied  little  over  the  0.0  to  4.0 


meter  vertical  transect,  but  decreased  substantially  from  5.13  mg  L'1  to  2.3  mg  L’1  over  the  4.0  to 
6.0  meter  depth  transect. 

Abundance,  composition,  and  diversity  of  hydA  in  GSL  water  column.  hydA 

amplicons  were  not  detected  in  surface  samples  (0.0  meters)  or  from  samples  collected  from 
depths  of  1.0  or  4.0  meters.  In  contrast,  hydA  genes  were  detected  in  samples  collected  from 
depth  intervals  of  6.0,  6.5,  and  8.0  meters,  as  well  as  from  benthic  sediment  samples  collected  at 
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-  8.5  meters.  The  relative  abundance  of  hydA  templates,  when  normalized  to  total  quantity  of 
extractable  DNA,  increased  with  increasing  depth  with  the  highest  concentration  of  hydA  genes 
detected  in  associated  with  benthic  sediment  biomass  (9.2±1.9  x  10'^  templates  /  ng  DNA).  The 
distribution  of  hydA  along  the  DWR3  water  column  is  coincident  with  the  position  of  the 
thennocline  and  with  decreasing  levels  of  PAR  and  dissolved  O?  and  increasing  salinity.  ,;Jhe 


abundance  of  hydA  templates  was  significantly  and  inversely  correlated  with  temperature 

#  C  J 

(Pearson  R  =  -0.96,  P  <  0.01)  and  dissolved  oxygen  (Pearson  R  =  -0.85,  P  =  0.03)  and  was 


y.  The 
ire 


positively  correlated  with  salinity  (Pearson  R  =  0.93,  P  <  0.01). 


to 


=  0.03) 


A  total  of  21 1  hydA  sequences  (47  to  58  clones  per  sample)  that,  when  translated, 

harbored  signature  motifs  characteristic  of  HydA  {Meyer,  2007  #7}  were  obtained  in  the  current 

J  ^ 

study  (Table  2).  The  average  hydropathy  of  amino  acids  (GRAVY  index)  comprising  the 
inferred  protein  sequences  was  calculated  and  compared  to  sequences  obtained  with  the  same 
primer  sets  from  other  environmental  systems,  including  microbial  mat  communities  inhabiting 
salterns  at  Guerrero  Negro  (GN),  Mexico  {Boyd,  2009  #108}  and  hot  springs  in  Yellowstone 
National  Park  (YNP),  USA  {Boyd,  2010  #191}.  The  GRAVY  index  for  HydA  sequences 


obtained  from  the  GSL  DWR3  water  column  spanned  values  ranging  from  0.04  indicative  of 
average  hydrophobic  amino  acid  content  to  -0.56  which  indicates  a  very  low  hydrophobic  amino 
acid  content.  There  was  no  discemable  pattern  in  the  average  GRAVY  index  for  HydA 
sequences  obtained  from  the  GSL  with  sampling  depth  (data  not  shown).  However,  the  average 
GRAVY  index  for  sequences  obtained  from  the  GSL  water  column  (14.9  to  24.7%  salinity)  was 
-0.30  whereas  average  GRAVY  indices  obtained  GN  (8%  salt)  and  YNP  (0.5  to  3.1%  salinity) 
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were  -0.26  and  -0. 15,  respectively.  This  indicates  that  HydA  assemblages  from  GSL  on  average 
comprise  a  greater  number  of  hydrophobic  amino  acids  than  those  from  GN  and  YNP. 


Faith’s  index  of  phylogenetic  diversity  (PD),  a  B-diversity  metric  that  quantifies  the 
proportion  of  total  branch  length  in  the  phylogeny  associated  with  sequences  obtained  from  a 


irom  a 


given  environment  when  compared  to  all  environments  when  considered  together,  increased 

♦  u 

systematically  with  increasing  depth  (Table  2).  hvdA  deduced  amino  acid  sequences  recovered 
from  the  four  GSL  environments  were  distantly  related  to  HydA  from  cultivated  organisms,  with 


Luvait 

a 


the  average,  minimum,  and  maximum  observed  sequence  identities  of  52%,  64%,  and  76% 
(Supp.  Table  1).  The  majority  (99.6%)  of  the  sequences  obtained  from  the  4  GSL  environments 
exhibited  affiliation  with  bacteria,  although  a  single  sequence  affiliated  (62%  identical)  with  a 
eukaryotic  diatom  Thalassiosira  pseudonana  was  recovered  from  the  benthic  sediments.  Despite 
significant  variation  in  the  taxonomic  affiliation  of  HydA  assemblages  sampled  at  depth,  binning 
the  sequences  at  the  class  level  of  taxonomic  resolution  failed  to  uncover  discernable  patterns  in 
the  taxonomic  composition  of  assemblages  as  a  function  of  depth  (Fig.  IB),  consistent  with 
previous  studies  indicating  the  tendency  for  hydA  to  be  horizontally  transferred  {Schmidt,  2010 


ifA, 


#233}.  HydA  affiliated  with  Clostridia  were  the  most  abundant  regardless  of  the  sampling 

& 


depth. 


^^Vari 


Variation  in  H-CIuster  Binding  Motifs  and  Putative  Gas  Channels.  A  variety  of 


substitutions  were  noted  in  the  LI  motif,  including  a  Cys  to  Ser  substitution,  relative  to  position 
300  of  the  Clostridium  pasteurianum  (Cpl)  HydA  sequence  (AAA23248).  This  residue  is 
involved  in  coordinating  the  active  site  H-cluster  {Peters,  1998  #101}.  Interestingly,  a  number 
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of  substitutions  were  also  observed  in  residues  that  line  a  putative  gas  channel  that  leads  to  the 
oxygen  sensitive  active  site  H  cluster  {Cohen,  2005  #234}.  In  particular,  substitution  of  the 
much  larger  Phe  for  Lys  and  lie  (positions  283  and  287  of  Cpl,  respectively)  were  noted  in 
several  GSL  sequences.  Conversely,  other  proteins  exhibited  substitution  of  Lys  for  Phe 
(position  293  of  Cpl).  Intriguingly,  a  large  insertion  just  upstream  from  the  LI  motif  was 

observed  in  a  number  of  sequences  recovered  from  GSL.  These  insertions  occur  in  a  region  of 

#  j,  J 

HydA  that  fonns  the  loop  that  caps  one  of  the  putative  gas  channels  {Cohen,  2005  #234}. 


Similar  insertions  are  common  in  phototrophic  algae  {Beer,  2009  #126} 


'O 


Inferred  Structural  Variation  of  GSL  HydA.  A  database  of  HydA  sequences  (Supp. 

Table  3)  was  compiled  from  complete  genome  sequences  present  in  GenBank  based  on  previous 

J  ^ 

characterizations  of  their  N-  and  C-terminal  cluster  composition  (Fig.  4)  and  quaternary  structure 
{Calusinska,  2010  #210;Meyer,  2007  #7}.  Additional  sequences  were  obtained  based  on 
BLASTp  searches  and  the  primary  sequence  and  quaternary  structural  characteristics  were 
determined  as  described  previously  {Calusinska,  2010  #210}.  These  sequences  and  the  modular 
composition  of  the  N-  and  C-terminal  domains  (i.e.,  F-  and  C-clusters)  and  their  quaternary 


structure  was  used  to  develop  a  Bayesian  phylogenetic  framework  for  predicting  these  features 
based  on  the  hydA  fragment  obtained  from  GSL.  Similar  to  previous  phylogenetic  analyses  of 
the  H-cluster  domain  {Calusinska,  2010  #210;Meyer,  2007  #7},  phylogenetic  reconstruction  of 
just  the  H-cluster  fragment  (positions  280  to  419  of  HydAl  from  Cpl)  in  sequences  recovered 
from  GenBank  revealed  clustering  that  corresponded  to  primary  and  quaternary  structural 
characteristics  (Fig.  5)  indicating  that  the  phylogenetic  signal  of  the  fragment  was  sufficient  for 
predicting  the  characteristics  of  HydA  from  GSL  at  these  levels  of  consideration.  The  majority 
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377  (93.8%  of  total  sequences)  of  HydA  homologs  recovered  from  GSL  were  nested  within  or 

378  formed  clusters  with  reference  sequences  that  have  the  M3  architecture  and  that  form  trimers 

379  (TRM3)  (Supp.  Table  1).  Only  a  small  fraction  of  sequences  inferred  to  have  a  thermotogae 

380  monomeric  configuration  (5.3%  of  total  sequences);  there  was  no  discernable  pattern  in  the 


distribution  of  these  sequences  as  a  function  of  depth  in  the  DWR3  water  column  (data  noj^ 

shown).  In  addition,  0.88%  of  the  sequences  did  not  cluster  or  nest  phylogenetically  with 

#  j,  J 

sequences  comprising  our  database  preventing  prediction  of  the  F-  and  C-cluster  composition  or 
quaternary  structure.  These  sequences  generally  had  low  sequence  identities. 


DISCUSSION 


<7 


e- 


A  diversity  of  [FeFej-hydrogenase  were  identified  along  the  DWR3  vertical  gradient  in 
the  GSL.  While  the  phylogenetic  framework  developed  here  indicates  that  the  majority  of  these 


sequences  are  trimeric  and  are  possibly  involved  in  FL-based  electron  bifurcation,  it  is  unclear  if 
they  are  involved  in  the  production  or  consumption  of  FL.  Examples  of  bifurcating  [FeFe]- 


392  hydrogenase  involved  in  oxidation  and  production  of  FL  have  been  described.  For  example,  an 

393  enzyme  that  functions  to  bifurcate  electrons  derived  from  the  oxidation  of  FL  coupled  to  the 

394  reduction  of  Fd  and  NAD+  has  been  identified  in  Acetobacterium  woodii  {Schuchmann,  2012 

395  #228}  while  other  enzymes  that  function  to  couple  the  oxidation  of  Fd  and  NADH  to  the 

396  formation  of  FL  have  been  identified  in  Thermotoga  maritima,  Thermoanaerobacterium 

397  saccharolyticum,  and  Thermoanaerobacter  tengcongensis  {Schut,  2009  #222;Soboh,  2004 


398  #102;Shaw,  2009  #236}.  Moreover,  two  recent  studies  reveal  the  presence  of  reversible 
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bifurcating  complexes.  In  the  case  of  the  acetogen  Moorella  thermoacetica,  a  reversible  Fd-  and 
NAD-dependent  trimeric  [FeFe]-hydrogenase  was  identified  {Wang,  2013  #226}. 


The  distribution  of  genes  encoding  putative  bifurcating  [FeFe]-hydrogenase  was  strongly 
influenced  by  oxygen  and  salinity  gradients.  High  salinity,  such  as  that  observed  in  along  the 
DWR3  vertical  gradient,  is  generally  unfavorable  to  proteins  due  to  lower  water  activity, 

#  j,  J 

increases  in  the  strength  of  hydrophobic  interactions  during  protein  folding,  and  a  reduction  in 

electrostatic  interactions  in  proteins  residing  in  the  cytoplasm  {Lanyi,  1974  #24;Dennis,  1997 

/  (/ 

#25}.  HydA  recovered  from  the  hypersaline  GSL  water  column  were  enriched  in  hydrophilic 
amino  acids  as  indicated  by  significantly  lower  GRAVY  indices  when  compared  to  HydA 
sequences  obtained  from  lower  salinity  mats  inhabiting  Guerrero  Negro  salterns  (~  8.0%  salt) 
and  Yellowstone  National  Park  hot  springs  (-0.5-3. 1%  salt).  This  finding  is  consistent  with  the 
results  of  previous  investigations  of  the  proteomes  of  a  variety  of  halophilic  and  non-halophilic 
organisms  which  indicate  lower  hydrophobicity  indices  than  for  non-halophilic  counterparts. 
Lower  hydrophobicity  in  proteins  is  a  feature  that  is  hypothesized  to  reduce  the  effects  of  salt- 
driven  non-specific  inter  or  intramolecular  interactions  in  hydrophobic  side-chains,  which  lead  to 


protein  mis-folding  and/or  aggregation  {Reistad,  1970  #38;Lanyi,  1974  #24;Spahr,  1962 
#40;Visentin,  1972  #41;Hutcheon,  2005  #43;Joo,  2005  #44}.  Similar  results  were  also  obtained 
from  a  comparative  genomic  and  proteomic  analysis  of  6  halophilic  and  24  non-halophilic 
isolates,  spanning  both  the  bacterial  and  archaeal  domains,  which  also  indicated  a  lower 
hydrophobicity  for  halophilic  proteins  {Paul,  2008  #42}.  Thus,  the  lower  hydrophobicity  of 
HydA  in  the  GSL  HydA  suggests  that  these  proteins  are  uniquely  adapted  to  the  high  salt 
conditions  present  in  their  natural  environment. 
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HydA  was  identified  in  transects  of  the  DWR3  water  column  where  salt  levels  were  high 
(15-24%)  but  where  dissolved  O2  was  depleted  relative  to  surface  waters.  The  hypoxic  (dCL 
range  0.98  -  2.32  mg/L)  of  the  DWR3  transects  where  hydA  was  detected  implies  either 
anaerobic  microniches  that  allow  anaerobes  to  exist  in  otherwise  sub-oxic  conditions,  a  lower 


effective  exposure  of  the  organism  or  its  oxygen  sensitive  enzymes  to  O2,  or  enhanced  tolerance 
to  O2.  The  solubility  and  bioavailability  of  O2  is  lower  in  hypersaline  solutions  {Lange,  1972 
#237}  which  may  allow  anaerobic  organisms  to  inhabit  otherwise  hypoxicjy^nditions 


dc  cor 

7r 


Intriguingly,  many  of  the  hydrogenases  described  in  this  study  exhibit  novel  substitutions  in  the 
LI  sequence  motifs,  which  are  involved  in  the  coordination  of  the  oxygen-labile  [4Fe-4S] 

subcluster  of  the  H  cluster  of  HydA  {Nicolet,  2000  #99;Nicolet,  1999  #100;Peters,  1998  #101}. 

J  ^ 

Substitutions  in  this  region  may  have  implications  for  the  redox  properties  of  this  [4Fe-4S] 
cluster  and  thus  may  be  important  in  conferring  stability  in  the  presence  of  O2  {Posewitz,  2008 


#12}. 


Most  organisms  that  harbor  a  bifurcating  [FeFe]-hydrogenase  harbor  additional  non¬ 
bifurcating  enzymes  {Calusinska,  2010  #210;Meyer,  2007  #7}.  It  is  therefore  a  surprise  that  the 
majority  of  sequences  recovered  in  the  present  examination  of  HydA  diversity  in  GSL  were 
predicted  to  be  of  the  bifurcating  type.  It  is  possible  that  the  primers  used  in  the  present  study 
are  biased  toward  homologs  that  fonn  trimers  and  that  are  thus  potentially  involved  in 
bifurcation  (e.g.,  TRM3)  {Calusinska,  2010  #210;Meyer,  2007  #7}.  Indeed,  a  prior  comparison 
of  the  primers  used  in  this  study  with  other  primer  designs  found  that  each  is  biased  toward  a 
different  spectrum  of  HydA  diversity  {Schmidt,  2010  #233}.  Application  of  untargeted  shotgun 
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metagenomic  sequencing  approaches,  which  are  currently  ongoing,  will  provide  a  mechanism  to 
1 )  evaluate  the  potential  bias  associated  with  the  recovery  of  predominantly  bifurcating  enzymes 
in  the  present  study  and  2)  to  evaluate  the  predictive  capacity  of  the  phylogenetic  approaches 
used  here  to  assign  secondary  and  quaternary  structure  to  the  recovered  protein  sequences. 


The  detection  of  abundant  putative  trimeric  bifurcating  [FeFe]-hydrogenase  along  the 

#  j,  J 

GSL  DWR3  vertical  gradient  coupled  with  the  numerous  studies  that  continue  to  expand  the 
functional  capabilities  of  these  enzymes  {Schuchmann,  2012  #228;Schut,  2009  #222;Wang, 

/  C/ 

2013  #225;Wang,  2013  #226}  may  have  a  significant  application  in  biofuel  production.  One 

Ot 

limitation  for  the  production  of  highly  reduced  bioproducts  is  that  a  large  proportion  of  the 
metabolic  electron  flow  is  directed  towards  reduction  of  pyridine  nucleotides  (i.e.,  to  NAD(P)H). 
Thermodynamically  the  reduced  pyridine  nucleotides  are  not  sufficient  in  energy  for  the 
production  of  a  number  of  desirable,  highly  reduced  biofuel  products.  [FeFej-hydrogenase 
catalyzed  electron  bifurcation  potentially  overcomes  this  thennodynamic  challenge  by 
combining  electrons  from  the  reduced  pyridine  nucleotides  with  other  lower  potential  electrons 
from  reduced  Fd  {Buckel,  2013  #224;Wang,  2013  #225;Wang,  2013  #226;Schut,  2009  #222}. 
This  increases  the  reducing  potential  of  electrons  from  reduced  pyridine  nucleotides  such  that  a 
larger  flux  is  directed  toward  the  production  of  desired  biofuels  (i.e.,  reduced  products).  That 


other  electron  donors  (e.g.,  CO,  formate)  {Wang,  2013  #225}  can  potentially  feed  into  these 
complexes  expands  the  potential  use  of  these  enzymes  in  the  directing  the  flux  of  metabolic 
electrons  toward  desired  biofuels.  Effort  aimed  at  isolating  representatives  of  the  dominant,  yet 
uncharacterized  organisms  harboring  the  unique  [FeFe]-hydrogenase  enzymes  identified  herein 
should  continue  to  expand  our  understanding  of  the  diversity  of  biochemical  reactions  catalyzed 
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by  this  unique  class  of  enzyme  and  which  may  prove  useful  in  overcoming  key  bioengineering 
barriers. 
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Table  1.  Geographical,  physical,  and  chemical  data  for  the  water  column  depths  sampled". 


Depth  (m) 

Latitude 

Longitude 

Salinity  (ppt) 

Dissolved 

02  (mg/L) 

PAR  (pM) 

Temp  (C) 

pH 

0.0 

41.1674600 

-112.6696117 

149 

5.46 

2079 

24.17 

7.21 

1.0 

41.1674600 

-112.6696117 

151 

5.38 

1219 

24.16 

7.46 

4.0 

41.1674717 

-112.6696117 

143 

5.13 

175 

23.77 

7.83 

6.0 

41.1674717 

-112.6696417 

151 

2.32 

75 

23.37 

8.07 

6.5 

41.1674833 

-112.6696200 

203 

1.68 

44 

20.20b 

7.10b 

8.0 

41.1674817 

-112.6696083 

247 

0.98 

0 

16.59 

5.95 

“Previously  reported  in  Meuser  et  al.,  2013 

bData  points  were  not  measured,  but  were  interpolated  based  on  observed  trends. 
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Figure  1.  Abundance  of  hydA  templates  as  a  function  of  depth  at  DWR3  (A).  Taxonomic 
composition  of  HydA  protein  sequences  as  determined  by  BLASTp  analysis  (B).  Further  details 
of  the  taxonomic  composition  of  HydA  are  located  in  Supp.  Table  1. 


GRAVY 


Figure  2.  Hydropathy  values  associated  with  HydA  from  GSL,  Guererro  Negro  (GN),  and 
Yellowstone  National  Park  (YNP),  as  assessed  using  the  GRAVY  index.  GRAVY  indices  for 
sequences  from  each  environment  were  binned  at  0.04  increments  (rounded  up). 
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Figure  3.  Unique  substitutions  in  the  LI  motif  and  insertions  upstream  from  the  LI  motif 
observed  in  recovered  GSL  HydA  protein  fragments.  HydAl  and  HydA2  from  Chlaymdomonas 
reinhardtii  (Cr_HydAl  and  Cr_HydA2,  respectively)  and  HydAl  from  Clostridium 
pasteurianum  (Cpl)  are  indicated.  Substitutions  and  insertions  are  outlined  above  in  reference  to 
the  Cpl  sequence. 
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Figure  4.  Accessary  cluster  domains  associated  with  diverse  HydA.  The  composition  of  N-  and 
C-  terminal  modules  (F-  and  C-clusters,  respectively)  as  identified  in  Meyer  et  al.,  2007  and  as 
modified  by  Calusinska  et  al.,  2010  are  presented. 


Figure  5.  Bayesian  phylogenetic  reconstruction  of  263  HydA  protein  sequences  compiled  from 
Calusinska  et  al,  2010  and  from  GenBank  (Supp.  Table  3).  The  phylogeny  was  generated  using 
only  the  region  of  HydA  corresponding  to  the  fragment  obtained  from  the  GSL  water  column 
[positions  280  to  419  of  HydA  1  from  Clostridium  pasteurianum  (AAA23248)].  The  primary 
quaternary  structure  of  the  homologs  (monomeric,  dimeric,  trimeric,  tetrameric)  as  outlined  in 
Meyer,  2007  and  more  recently  in  Calusinska  et  ah,  2010  are  overlaid  on  the  phylogeny. 
Architectures  correspond  with  those  presented  in  Fig.  3. 
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Draft  genome  and  targeted  transcriptional  analysis  of  Clostridium  pasteurianum  (strain  W5) 
provides  insights  into  possible  determinants  of  H2  oxidation  and  reduction 


Jesse  B.  Therien,1  Trinity  L.  Hamilton,1  Zhenfeng  Liu,2  Seth  M.  Noone3,  Paul  W.  King3,  Donald 

A.  Bryant,2  John  W.  Peters1# 

Department  of  Chemistry  and  Biochemistry,  Montana  State  University,  Bozeman,  Montana 
5971 71 ;  Department  of  Biochemistry  and  Molecular  Biology,  The  Pennsylvania  State  University, 
University  Park,  Pennsylvania  168022;  National  Renewable  Energy  Laboratory,  Biosciences 

Center,  Golden,  CO  8040 13 


a 


& 


\P 

JO 


<5 


& 


"S' 


46  Abstract 


Clostridium  pasteurianum  (strain  W5)  has  been  the  model  organism  for  the  study  of  nitrogen 
fixation  and  hydrogen  metabolism  for  over  a  century.  The  first  generation  of  biochemical  char¬ 
acterization  of  complex  iron-sulfur  cluster  containing  [FeFe]-hydrogenases  and  Mo-nitrogenase 
were  carried  out  on  enzymes  purified  from  C.  pasteurianum.  It  was  demonstrated  that  twto^ 
[FeFe]-hydrogenases  are  expressed  differentially  under  nitrogen- fixing  and  non  nitrogen- fixing 
conditions.  It  was  proposed  that  the  first  characterized  [FeFe]-hydrogenase  (CpI)  had  a  primary 
role  in  central  metabolism  in  recycling  reduced  electron  carriers  that  accumulated  during  fermen¬ 
tation  under  non  nitrogen-fixing  conditions  via  proton  reduction,  while  the  second  characterized 
hydrogenase,  CpII,  was  proposed  to  have  a  role  in  capturing  reducing  equivalents  lost  as  hydro¬ 


gen  during  nitrogen  reduction.  In  support  of  this,  previous  in  vitro  biochemical  characterization 


of  CpI  and  CpII  indicate  CpI  has  elevated  proton  reduction  activity  in  comparison  to  CpII  and 
CpII  has  elevated  hydrogen  oxidation  in  comparison  to  CpI.  We  have  sequenced  the  genome  of 
C.  pasteurianum  (strain  W5)  to  gain  further  insight  into  the  role  of  hydrogenases  in  metabolism. 
The  genome  sequence  provides  evidence  of  four  hydrogenases  including  a  previously  unidenti¬ 
fied  third  [FeFe]-hydrogenase  and  a  [NiFe] -hydrogenase.  Under  nitrogen-fixing  conditions  we 
have  shown  that  the  transcription  of  an  [FeFe]-  and  the  [NiFe]-hydrogenase  are  increased  in  con¬ 
cert  with  down  regulation  of  the  canonical  [FeFe] -hydrogenase  involved  in  central  metabolism. 
Comparison  of  the  primary  sequences  of  CpI  and  CpII  and  their  close  homologues  provides  the 


basis  for  identifying  key  potential  determinants  of  modulating  hydrogen  production  and  hydro¬ 
gen  oxidation  activities. 
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Clostridium  pasteurianum  (strain  W5)  is  a  gram-positive,  spore-forming,  obligately  anaerobic 
bacteria  found  in  soil  and  was  the  first  free-living  nitrogen-fixing  organism  to  be  isolated  (1). 

For  over  100  years,  it  has  been  a  model  for  studying  the  biochemistry  of  nitrogen  fixation  and 
hydrogen  metabolism.  The  first  generation  of  biochemical  characterization  of  complex  iron- 
sulfur  clusters  from  an  [FeFe]-hydrogenase  and  Mo-nitrogenase  were  carried  out  from  enzymes 
purified  from  C.  pasteurianum.  Following  the  discovery  of  hydrogenase  in  C.  pasteurianum  (2) 
the  first  preparations  of  a  soluble  hydrogenase  were  obtained  from  this  organism  (3,  4).  Subse¬ 
quently,  the  presence  of  a  second  [FeFe]-hydrogenase  (CpII)  was  revealed  (5)  and  its  physical 
and  catalytic  properties  were  studied  along  with  the  first  (CpI)  (6).  Decades  later  the  first  struc¬ 
ture  of  an  [FeFe] -hydrogenase  (CpI)  was  determined  from  C.  pasteurianum  (7). 

The  Clostridia  are  a  diverse  group  of  anaerobes  that  produce  many  fennentative  metabolites.  In 
general,  clostridia  gain  their  energy  from  the  conversion  of  a  hexose  to  butyrate,  acetate,  CO2 
and  H2.  Fermentation  typically  consists  of  the  conversion  of  glucose  to  pyruvate  through  the  gly¬ 
colytic  pathway.  Reduced  NADH  produced  during  glycolysis  can  subsequently  reduce  ferredox- 
in  via  NADH:ferredoxin  oxidoreductase  which  then  can  reduce  protons  to  H2  via  hydrogenase. 

At  this  point,  pyruvate  can  be  drained  away  through  the  action  of  lactate  dehydrogenase,  but  oth¬ 
erwise  is  converted  to  acetyl-CoA  via  pyruvate iferredoxin  oxidoreductase  with  the  concomitant 
production  of  CO2.  During  this  step,  reduced  ferredoxin  can  again  be  used  to  supply  hydrogenase 
with  ability  to  reduce  protons  to  H2.  Acetyl-CoA  is  then  converted  to  either  acetate  or  one  of 


90  many  alcohols. 


91 


92  In  order  to  help  Clostridia  maintain  the  proper  redox  state  during  fermentation,  hydrogenases  re- 

93  versibly  couple  electrons  (i.e.  excess  reducing  equivalents)  with  protons  to  fonn  hydrogen. 


2H  +  2e'  H2 

Many  hydrogenases  preferentially  catalyze  this  reaction  in  one  direction  (8).  Therefore,  it  may 
benefit  an  organism  to  possess  a  hydrogenase  tuned  for  hydrogen  production  and  another  tuned 

A 

for  hydrogen  oxidation,  such  that  either  hydrogenase  could  be  used  dependent  upon  specific 
metabolic  needs.  In  fact,  a  large  number  of  Clostridial  species  have  been  shown  to  have  multiple 
hydrogenases  (8,  9),  including  C.  pasteurianum,  from  which  the  60.5kDa  CpI  and  53  kDa  CpII 
were  first  isolated.  After  its  initial  characterization  (3),  CpI  was  observed  to  produce  ff  in  vitro 
using  methyl  viologen  as  an  electron  carrier  (4),  while  CpII  was  first  thought  to  be  a  unidirec¬ 
tional  H2-oxidizing  hydrogenase  (5).  It  was  later  shown  that  CpI  and  CpII  are  bidirectional  and 
catalyze  both  the  production  and  uptake  of  H2  (10).  A  comparison  of  the  rates  of  H2  evolution 
and  oxidation  revealed  that  CpI  produces  H2  550  times  faster  than  CpII  (5,500  vs.  10  pmol  of 
H2/min*mg  respectively)  while  it  oxidizes  H2  about  30%  slower  than  CpII  (24,000  vs.  34,000 
pmol  of  H2/min*mg  respectively)  (11).  With  the  ability  to  purify  CpI  and  CpII,  it  was  shown 
colorimetrically  that  CpI  contains  20.1  ±  0.7  moles  of  iron  per  mole  of  protein  while  CpII  con¬ 
tains  13.8±0.4(12).  In  both  CpI  and  CpII,  fourteen  of  these  iron  atoms  were  attributed  to  a  six- 
iron  hydrogenase  cluster  (H-cluster  (13)),  thought  to  be  where  catalysis  takes  place,  and  two 
[4Fe4S]  accessory  “ferredoxin”  clusters  (F-clusters  (13))  suspected  of  facilitating  electron  trans¬ 
fer  to  and  from  the  H-cluster  (12,  14).  Using  Resonance  Raman  spectroscopy,  an  additional 
[2Fe2S]  cluster  was  detected  in  CpI,  but  not  CpII  (15).  Thus,  in  total,  CpI  was  shown  to  contain  a 
six-iron  H-cluster  three  [4Fe4S]  clusters  and  a  [2Fe2S]  cluster,  while  CpII  was  lacking  one 


[4Fe4S]  and  the  [2Fe2S]  F-cluster  in  comparison  to  CpI. 
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Hydrogen-producing  hydrogenases  in  anaerobes,  as  mentioned,  are  used  as  a  way  to  dispose  of 
reducing  equivalents  during  fermentation  (16).  This  explains  the  role  of  CpI,  but  does  not  ex¬ 
plain  the  biological  function  of  CpII,  which  we  hypothesize  functions  metabolically  as  an  uptake 
hydrogenase.  Hydrogen  uptake  hydrogenases  are  typically  coupled  to  energy  yield¬ 


ing/conserving  reactions,  for  example,  nitrogen  fixation.  Here,  for  every  mole  of  dinitrogen  re- 
duced,  eight  protons,  eight  electrons,  and  16  ATP  are  used  to  produce  two  moles  of  ammonia 


and  one  mole  of  dihydrogen  (17);  the  reaction  is  shown  below. 


N2  +  8H+  +  8e  +  16MgATP  2NH3  +  H2  +  16MgADP  +  16Pi 


)  iiiwivo  v/i  uimi 

vO 

5> 


SO 


The  high  ATP/electron  demand  of  nitrogen  fixation  can  stress  an  organism  energetically;  there- 
fore  the  presence  of  an  uptake  hydrogenase  is  of  value  in  order  to  recycle  electrons.  Fittingly,  C. 
pasteurianum  is  a  nitrogen  fixer  and  would  benefit  from  an  uptake  hydrogenase  during  nitrogen 


fixation.  In  some  nitrogen  fixing  organisms  such  as  Azotobacter  vinelandii  (18),  Rhodopseu- 
domonas  palustris  (19),  and  others  (20,  21),  uptake  hydrogenases  have  been  shown  to  be  co¬ 
transcribed  with  the  nitrogen  fixing  machinery.  However  without  the  gene  sequence  of  CpII, 


similar  transcriptional  studies  are  not  possible.  Thus,  we  decided  to  sequence  the  genome  of  C. 


pasteurianum.  The  draft  genome  would  not  only  allow  us  to  perform  the  transcriptional  analysis 

vv 

necessary  to  determine  whether  Cpll  indeed  is  upregulated  during  nitrogen  fixation  and  thereby 
recycling  H2  to  conserve  energy,  but  also  allow  us  to  identify  the  gene  context  of  both  CpI  and 
CpII.  Furthermore,  the  gene  context  of  the  previously  sequenced  nitrogenase  (nif)  gene  cluster 
(22)  could  provide  insight  into  the  relationship  between  nitrogenase  and  the  hydrogenases. 


MATERIALS  AND  METHODS 
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Growth  conditions.  Freeze-dried  C.  pasteurianum  W5  (ATCC®  6013™)  was  obtained  from 
ATCC  and  rehydrated  with  Difco™  Reinforced  Clostridial  Medium  following  the  ATCC  proto¬ 
col.  Sealed  25  ml  glass  serum  vials  (Weaton)  containing  10  ml  of  Difco™  Reinforced  Clostridial 
Medium  under  a  headspace  of  10%  H2-10%  CO2-80%  N2  were  then  inoculated  with  the  rehy¬ 
drated  culture  and  incubated  at  37°C  following  ATCC  propagation  procedures  for  this  organism. 
Agar  plates  of  the  above  medium  were  used  to  store  C.  pasteurianum  W5  for  further  use;  initial¬ 
ly  grown  at  37°C  for  24-48  hours  before  being  stored  at  room  temperature  in  an  anaerobic  cham¬ 
ber. 


For  genome  sequencing,  cultures  were  inoculated  from  a  single  colony  from  a  plate  of  Difco™ 
Reinforced  Clostridial  Medium  that  was  inoculated  into  a  sealed  25ml  serum  vials  containing 
10ml  of  the  same  media,  under  a  headspace  of  10%  H2-10%?C02-80%  N2  were  then  incubated 
overnight  at  37°C.  1  ml  of  culture  was  then  spun  down  at  14,000  x  g  at  room  temperature  before 
extracting  DNA. 


hen  spui 


For  qRT-PCR,  cultures  were  begun  in  a  similar  manner  as  above,  after  which  1-5  ml  of  overnight 

/X, 

culture  were  used  to  inoculate  50  ml  of  both  DM-1 1  (23)  (N+;  containing  NH4CI  and 


(NFD3SO4)  and  DM-1 1-N  (N-;  fixed  nitrogen  free)  media  to  an  OD650  =  -0.020  and  were  then 
sparged  with  10%  H2-10%  CO2-80%  N2  for  10  minutes  and  incubated  overnight.  This  was  re¬ 
peated  once  more  after  which  point  a  C2H2  reduction  assay  (24)  was  performed  to  ensure  N2 
fixation  was  taking  place  in  the  N-  culture.  Then,  in  sealed  123  mL  serum  vials,  1  mL  (N+)  was 
added  to  50  mL  of  N+  media,  and  6  mL  of  (N-)  were  added  to  50  mL  of  N-  media,  in  order  to 
make  a  starting  OD65o  =  -0.020  for  each  culture.  The  vials  were  once  again  sparged  with  the  gas 


161  mix  and  incubated  overnight  at  37°C.  The  following  day,  another  C2H2  reduction  assay  was  per- 

162  formed  to  ensure  the  N-  culture  was  fixing  N2.  500  pi  samples  were  mixed  with  1  ml  of  QI- 

163  AGEN  RNAprotect  Bacterial  Reagent®  following  the  included  procedure,  and  either  used  im- 

164  mediately  for  RNA  extraction  or  frozen  at  -20  °C  until  later  RNA  extraction. 

165 

166  Genome  sequencing.  Total  genomic  DNA  of  C.  pasteurianum  W5  was  extracted  using  a 

167  Promega  Wizard®  Plus  SV  minipreps  DNA  purification  system  and  its  concentration  detennined 

168  by  a  NanoDrop  1000  Spectrophotometer  to  be  220  ng/pl  (Abs26o/28o=  2.04).  Genomic  DNA  was 

169  then  submitted  to  the  Genomics  Core  Facility  at  The  Pennsylvania  State  University  for  SOLiD 

170  sequencing  (25).  With  the  help  of  the  Genomics  Core  Facility,  contigs  were  assembled  onto  six 

171  scaffolds,  leaving  thirty  standalone  contigs  that  did  not  initially  fit  onto  the  scaffolds. 

172  (py\ 

173  Gaps  were  closed  by  PCR  using  primers  designed  approximately  200  bases  from  the  end  of  each 

174  contig.  GoTaq®  2x  Master  Mix  reagent  by  Promega  was  used  for  the  PCR  on  a  Techne  Touch- 

175  gene  Gradient  Thennal  Cycler.  Amplicons  were  then  purified  either  directly  using  QIAquick 

176  PCR  Purification  Kit  or  from  agarose  gels  using  the  Qiaex  II  Gel  Extraction  Kit,  both  from  QI- 

177  AGEN.  Purified  PCR  products  were  then  sequenced  by  Davis  Sequencing  in  Davis,  CA.  Se- 

178  quence  data  was  then  assembled  using  BioEdit. 

179 

180  Putative  Gene  annotation  analysis  was  completed  using  the  RAST  (Rapid  Annotation  using  Sub- 

181  system  Technology)  prokaryotic  genome  annotation  server  (26).  The  protein  sequence  of  the  FI- 

182  cluster  domain  of  CpI  was  used  to  BLAST  the  draft  genome  in  RAST  and  all  matches  with  an 


183  expect  value  (E-value)  below  1.0  were  aligned  using  ClustalW  to  detennine  whether  they  con- 

184  tained  previously  published  signature  motifs  LI,  L2,  and  L3  found  in  [FeFej-hydrogenases. 


Preparation  of  total  RNA.  RNA  of  C.  pasteurianum  W5  was  extracted  using  QIAGEN  RNe- 
asy®  Mini  Kit  followed  by  a  DNase  treatment  step  using  RQ1  RNase-Free  DNase  by  Promega 
and  a  re-purification  using  the  QIAGEN  RNeasy®  Mini  Kit.  RNA  concentration  was  then  de¬ 
termined  using  Qubit®  RNA  Assay  Kit  and  was  then  frozen  at  -20  °C  until  further  analysis. 


Quantitative  RT-PCR  (qRT-PCR).  Primers  were  designed  for  the  4  hydrogenases  (CpI,  CpII, 
CpIII  and  the  large  subunit  of  [NiFe]),  3  nitrogenase  a-subunits  (nifD,  vnfD,  and  anj'D)  and  the 
16S  rRNA  small  subunit  using  the  Integrated  DNA  Technologies  SciTools  qPCR  online  primer 


designing  software.  qRT-PCR  was  perfonned  on  a  Rotor-Gene-Q  real-time  PCR  detection  sys¬ 
tem  from  Qiagen  using  the  Power  SYBR®  green  RNA-to-Cj™  1-Step  Kit  from  Applied  Biosys¬ 
tems  according  to  the  included  protocol.  Reactions  were  done  in  triplicate  with  control  reaction 


mixtures  containing  no  reverse  transcriptase 


RESULTS/DISCUSSION 
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Genome.  Initial  sequencing  resulted  in  145  contigs  ranging  from  51 1  to  251,858  bases  yielding 


an  estimated  genome  size  of  4,206,741  bp.  The  average  contig  length  was  29,012  bases  and  the 
average  number  of  reads  per  contig  was  1,618.  As  with  most  other  Clostridial  species  (27)  (28), 


the  GC  content  was  very  low  at  30%. 
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After  closing  90%  (131  of  145)  of  the  gaps,  annotation  analysis  predicted  4347  protein  coding 
sequences  (CDSs),  which  accounted  for  83%  of  the  genome  coverage.  The  Tandem  Repeats  Da¬ 
tabase  v2.30  (23)  indicated  the  presence  of  2 16  tandem  repeats;  the  highest  copy  number  was 
17.5  for  the  2  base  repeat  of  TA  and  the  longest  repeat  was  408  bases  with  2  copies.  There  were 
69  RNA  genes. 


According  to  RAST,  the  closest  neighbors  with  a  completed  genome  were  shown  to  be  numerous 
(twelve)  strains  of  Clostridium  botulinum,  Clostridium  novyi  NT,  and  Clostridium  sporogenes 
ATCC  15579.  C.  botulinum  strains  are  quite  numerous  and  represent  the  largest  species  of  Clos¬ 
tridia  with  complete  genomes;  according  to  NCBI,  13  complete  genomes  and  1 1  partial  genomes 


currently  exist. 


0 


a 


Hydrogenases.  The  genome  confirms  the  presence  of  three  [FeFej-hydrogenases,  CpI,  CpII, 
and  what  we  designate  CpIII,  in  addition  to  one  [NiFej-hydrogenase  and  all  necessary  hydrogen- 
ase  maturation  genes.  All  4  hydrogenases  are  distributed  throughout  the  largest  contig 
(2,726,463  bp)  (Fig.  1).  While  many  [FeFej-hydrogenase-containing  genomes  possess  multiple 
copies  of  structural  genes,  as  exemplified  by  C.  pasteurianum  and  other  clostridia  (9)  (8),  most 


often  there  is  only  one  copy  of  the  maturase  genes  hydE,  hydF,  and  hydG  (7).  Both  HydE  and 
HydG  belong  to  the  radical  S-adenosylmethionine  superfamily  of  proteins  while  HydF  is  a 
GTPase  (31).  The  genes  encoding  the  maturases  can  be  found  together  as  a  cluster  (hydEFG),  as 
two  sets  (hydEF,  hydG  or  hydE,  hydFG),  or  individually  (7)  as  was  observed  in  C.  pasteuri¬ 
anum.  Figure  1  shows  the  delocalization  of  the  maturase  genes  on  the  genome. 


The  encoded  protein  sequence  of  CpII  (458  residues)  is  slightly  shorter  than  that  of  CpI  (574  res¬ 
idues),  while  CpIII  is  shorter  still  (450  residues).  As  seen  in  figure  2,  [FeFej-hydrogenases  con¬ 
tain  the  strictly  conserved  [FeFe]-hydrogenase  H-cluster  signature  motifs;  299CCPx,  254PCxxK, 
and  495ExMxCxxGCxxG  (numbered  according  to  CpI)  (8,  29).  These  motifs  encompass  the  H- 
cluster  binding  cysteines,  denoted  in  figure  2  with  an  H.  previous  phylogenetic  clustering  of 
Clostridial  [FeFe]-hydrogenases  H-cluster  domains  has  shown  a  variety  of  distinct  clusters,  des¬ 
ignated  A1-A5,  A7,  A8,  and  B1-B3  (16).  Group  A2,  where  CpI  is  found,  is  comprised  of  solu¬ 
ble,  hydrogen-producing  enzymes.  Enzymes  of  group  A4,  into  which  CpII  clusters,  contain  a 
long  C-terminal  domain  that  ligates  two  small  rubredoxins  and  one  rubrerythrin  center  (9). 
Members  of  group  B2  such  as  CpIII  have  an  average  size  of  450  aa  and  an  additional  characteris¬ 
tic  cysteine  residue  in  the  PI  motif  (TSCCCPxW)  of  the  H-cluster  (8).  Interestingly,  the  ferre- 
doxin-binding  motif  closest  to  the  N-terminus  of  clostridial  B2  members  is  Cx2Cx-)Cx3C;  this  is 
similar  to  that  of  the  « //-associated  ferredoxin  in  Rhizobium  meliloti  which  is  Cx2Cx8Cx3C  (8). 

The  [NiFe]-hydrogenase  gene  cluster  contains  the  required  accessory  genes  ( HypABCEFD  and 
HoxN),  downstream  of  the  structural  genes,  HyaAB ,  which  encode  the  large  and  small  subunits, 
respectively.  The  protein  sequence  of  the  large  subunit  contains  previously  described  (29)  L 1 


and  L2  signatures  characteristic  of  membrane-bound  H2  uptake  hydrogenases.  The  LI  and  L2 
motifs  encompass  the  highly  conserved  cysteine  pairs  (CxxC)  near  each  terminus  that  ligate  the 
NiFe  center.  Unlike  [FeFej-hydrogenases,  maturases  for  the  [NiFej-hydrogenase  are  often  found 
in  a  single  gene  cluster  with  the  structural  genes,  as  is  the  case  for  the  C.  pasteurianum  genome. 
This  gene  cluster  is  not  in  close  proximity  to  any  other  hydrogenase  or  nitrogenase  genes.  The 


large  subunit  ( HyciB )  clusters  phylogenetically  with  other  Clostridia  in  group  1  (8)  (data  not 
shown)  which  are  membrane-associated  uptake  hydrogenases  (29). 


Transcriptional  analysis.  Transcriptional  experiments  were  repeated  a  minimum  of  three  times 
using  RNA  from  separate  growths.  Magnitudes  of  transcript  abundance  for  the  genes  of  interest 
varied  in  magnitude,  especially  for  nifD  which  varied  over  an  order  of  magnitude  in  one  instance 
(data  not  shown).  Despite  the  variation  in  magnitude,  results  consistently  displayed  a  similar 
trend.  Transcripts  of  the  three  [FeFe]-hydrogenase  genes  of  C.  pasteurianum  all  changed  under 
N2-fixing  conditions.  In  complementary  experiments  performed  by  our  collaborators  at  NREL 
(National  Renewable  Energy  Laboratory,  Golden,  CO),  transcripts  for  the  hydrogenases  and  the 
nitrogenase  in  Clostridium  acetobutylicum  were  monitored  under  nitrogen  fixing  conditions 
yielding  similar  increases  in  nifD  and  the  [NiFe]-hydrogenase.  The  reduction  of  dinitrogen  to 
ammonia  and  the  reduction  of  protons  to  dihydrogen  both  depend  on  reducing  equivalents. 
However,  the  assimilation  of  nitrogen  is  much  more  important  and  the  electron  requirement  of 
nitrogenase  (16  electrons)  far  exceeds  that  of  hydrogenase  (2  electrons).  Therefore,  it  would  be 


in  the  cell’s  interest  shut  down  any  competing  metabolism. 

Under  N2-fixing  conditions  transcripts  of  CpII  were  observed  to  increase  an  average  of  7.4  fold 


(Fig.  3).  Other  experiments  have  demonstrated  that  N2  fixing  cultures  of  C.  pasteurianum  oxi¬ 
dized  twice  as  much  H2  as  cultures  growing  on  media  containing  NH3,  that  the  ratio  of  CpII/CpI 
expressed  in  cultures  fixing  N2  was  50%  higher  than  in  cultures  not  fixing  N2  (5),  and  that  CpII 
had  low  H2  producing  rates  (compared  to  CpI)  and  high  H2  oxidizing  rates  in  vitro  (11).  Togeth¬ 
er,  these  observations  suggest  that  the  increase  in  transcription  of  CpII  likely  leads  to  an  increase 
in  CpII  enzyme,  which  in  turn  increases  the  rate  of  H2  oxidation.  Oxidation  of  H2  then  produces 


reducing  equivalents,  which  would  be  in  demand  during  N2-fixation.  In  contrast  to  CpII,  the 
transcription  of  CpI  and  CpIII  from  cells  grown  in  media  devoid  of  fixed  nitrogen  contained 
fewer  transcripts  (averaging  -2.9  and  -2.0  fold  respectively)  when  compared  to  cells  grown  in 
nitrogen  replete  media,  indicating  that  neither  CpI  nor  CpIII  function  as  uptake  hydrogenases. 


When  compared  to  cultures  grown  in  media  containing  NH3,  qRT-PCR  indicated  an  average  of 
an  8.7  fold  increase  in  the  number  of  transcripts  for  the  large  subunit  of  the  [NiFe]-hydrogenase 
in  cultures  grown  in  the  absence  of  fixed  nitrogen.  In  C.  acetobutylicum,  results  were  nearly 
identical  (figure  3).  The  increase  in  transcription  of  [NiFe]-hydrogenase  is  similar  to  what  was 
found  in  two  unrelated  nitrogen  fixers,  A.  vinelandii  (18)  and  R.  palustris  (19).  Both  studies 
suggested  the  [NiFej-hydrogenases  likely  function  to  recycle  H2  during  N2-fixation. 


Hydrogenases  in  Clostridial  metabolism 


’„TT  r. 


Our  transcriptional  analysis  suggests  that  CpII  and/or  the  [NiFe]-hydrogenase  acts  to  recycle  H2 
produced  as  a  byproduct  of  nitrogen  fixation.  Unlike  CpII,  our  qRTPCR  results  for  CpIII  tran¬ 
scription  provide  little  insight  into  its  metabolic  function.  Figure  5  illustrates  our  proposed  meta¬ 
bolic  scheme  for  CpI,  CpII  and  the  [NiFe]-hydrogenase  during  fermentation  and  nitrogen  fixa¬ 
tion  respectivel 

A  search  through  the  genomes  of  over  40  fully  sequenced  Clostridial  species  using  the  NCBI 
BLAST  tool  showed  that  all  but  two  species  (C.  kluyveri  and  C.  butyricum )  that  possess  nitro- 


genase  also  contain  a  [NiFe]-hydrogenase  (figure  4).  The  two  exceptions  each  have  three  or 
more  [FeFe]-hydrogenases,  suggesting  that  one  or  more  of  them  may  serve  as  an  uptake  hydro- 
genase,  thereby  supplanting  the  need  for  a  [NiFe]-hydrogenase  during  nitrogen  fixation. 


Structural  basis  for  hydrogen  production  /  hydrogen  oxidation  activities 

Given  that  CpI  and  CpII  have  such  dramatic  differences  between  their  abilities  to  produce  and 
oxidize  hydrogen,  we  hypothesize  that  there  must  be  some  structural  features  responsible  for  this 
difference.  Homology  models  of  CpII  (Fig.  2a)  and  CpIII  (not  shown)  generated  using  Swiss- 
Model  (30),  as  well  as  sequence  alignments  and  iron  incorporation  numbers,  indicate  the  absence 
of  accessory  domains  seen  in  CpI.  Starting  at  the  N-terminus  of  CpI,  there  are  conserved  cysteine 
residues  for  each  cluster  sequentially  binding  clusters  FS2,  FS4C,  FS4B  and  FS4A.  The  N- 


tenninus  lacks  residues  responsible  for  binding  accessory  clusters  FS2  and  FS4C  (Fig.  2).  How¬ 
ever,  conserved  regions  binding  the  remaining  [4Fe4S]  and  [2Fe]  clusters  of  the  H-cluster  and 
two  [4Fe4S]  accessory  clusters  have  been  identified.  In  CpIII,  the  N-tenninal  arrangement  of 
cysteines  is  unique.  Sequence  alignment  (Fig.  2)  reveals  that  the  FS4A  binding  motif  is  con¬ 
served,  while  the  FS4B  motif  lacks  two  of  the  four  cysteine  residues  typically  seen  ligating  this 
cluster. 


^  ruui  v  y  oic/ 


Hydrogenases  are  well  known  to  have  differing  FeS  cluster  binding  motifs  (31),  however, 
comparison  among  these  proteins  does  not  illuminate  a  trend  in  hydrogen  oxidation  to  produc¬ 


tion  ratios.  It  is  currently  unclear  to  what  extent  additional  clusters  play  in  the  role  of  kinetic  and 
catalytic  bias,  though  these  clusters  may  be  partially  responsible  for  detennining  catalytic  bias  to 


an  extent  that  is'hot  yet  clear.  Most  likely,  a  suite  of  structural  features  are  responsible  for  tuning 
the  directionality  of  a  given  hydrogenase.  The  particular  amino  acids  involved  in  gas  channel 
lining,  proton  transfer,  electron  transfer,  and  H-cluster  ligand  environment  may  all  play  a  role. 
Interestingly,  in  the  case  of  [NiFe]  hydrogenases,  Leger  et  al  report  that  mutations  in  the  gas 
channel  that  partially  block  the  channel  push  the  catalytic  bias  towards  oxidation,  while  mutating 
an  electron- transferring  FeS  cluster  coordinating  histidine  residue  to  a  glycine  resulted  in  biasing 


the  enzyme  to  favor  hydrogen  production  (32).  The  work  demonstrates  that  changes  far  away 
from  the  active  site  of  the  hydrogenase  may  impact  the  kinetics  of  hydrogenase,  often  in  unpre¬ 
dictable  ways. 

Knorzer  et  a  I  showed  that  the  most  frequent  substitution  for  M3  5  3  in  409  CpI  homologues  was 
T3 5 3,  as  in  CpII,  and  used  site  directed  substitution  to  change  this  residue  from  M3 5 3  to  L353  in 
C.  pasteurianum.  A  significant  decrease  in  H2  production  (15%  of  WT)  and  a  small  decrease  in 
H2  oxidation  (74%  of  WT)  was  observed  and  was  attributed  to  lowered  turnover  rate.  This 
indicates  that  the  residue  at  position  353  is  capable  of  influencing  the  enzymatic  preference  for 
hydrogen  oxidation/production,  with  the  leucine  causing  the  enzyme  to  favor  oxidation  to  a 
greater  degree  relative  to  the  methionine.  EPR  suggested  slight  differences  in  the  elctronic 
structure  and  it  was  thought  this  might  impede  the  the  transition  between  different  redox  states. 
Cyclic  voltametry  of  the  mutated  enzyme  showed  slightly  lowered  Km,  but  similar  catalytic  bias 
for  both  the  forward  and  reverse  reactions.  The  authors  therefore  concluded  that  differences  in 
assay  conditions  led  to  the  observed  difference  in  hydrogen  oxidation/production  compared  to 


Winkler  and  cowworkers  stated  that  it  was  unclear  whether  hydrogenases  with  a  threonine 
corresponding  to  CpI  position  353  were  active,  but  suggested  that  the  enzymes  “might  exhibit 
interesting  features’’  (33).  We  hypothesized  that,  similar  to  Knorzer’s  leucine,  the  biologically- 
relevant  threonine  may  play  a  role  in  tuning  the  catalytic  bias  of  the  enzyme.  Having  made  the 
mutation,  hydrogen  production  and  uptake  assays  simply  showed  a  decrease  in  activity  compared 
to  the  WT,  and  did  not  show  a  significant  change  in  the  ratio  of  oxidation/production  (data  not 


shown). 
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Our  initial  analysis  of  the  C.  pasteurianum  W5  genome  has  identified  a  [NiFe]-hydrogenase  and 
confirmed  the  presence  of  a  third  [FeFe]-hydrogenase.  It  is  quite  common  for  Clostridia  to  pos¬ 
sess  numerous  [FeFe]-hydrogenases  and  the  majority  of  sequenced  species  have  a  [NiFej- 
hydrogenase.  Fewer  of  the  sequenced  Clostridia  have  a  nitrogenase.  Out  of  the  40  Clostridial 
species  we  examined,  nearly  all  nitrogenase-containing  clostridia  also  had  a  [NiFe]-hydrogenase. 
Furthermore,  the  observed  increase  in  transcription  of  [NiFej-hydrogenase  under  N2-fixing  con¬ 
ditions  in  both  C.  pasteurianum  and  C.  acetobutylicum  suggests  the  [NiFej-hydrogenase  is  in¬ 
volved  in  recycling/uptake  of  H2  produced  as  a  byproduct  during  the  nitrogenase-catalyzed  re¬ 
duction  of  atmospheric  N2  to  NFLt+.  Biochemical  and  transcriptional  data  for  CpII  similarly  sug¬ 
gests  that  CpII  is  involved  in  FF  uptake  during  nitrogen  fixing  conditions.  So  far,  the  use  of 
CpIII  by  C .  pastuerianum  is  unclear,  and  additional  characterization  of  this  enzyme  may  clarify 
its  function.  Based  on  the  observed  characteristics  of  CpI  and  CpII,  as  well  as  other  evidence  in 
the  literature,  we  believe  that  it  may  be  possible  to  tune  the  catalytic  directionality  of  [FeFe]  hy- 

mutations  to  CpI. 
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106  Figure  Legends. 

107 

108  Figure  1.  Draft  genome  of  Clostridium  pasteurianum  and  location  of  hydrogenase  and  nitrogen- 

109  ase  genes  and  gene  clusters.  CpI,  CpII,  and  CpIII  indicate  the  [FeFe]-hydrogenases.  nif  cluster 

110  refers  to  the  Mo-dependent  nitrogenase  while  the  vnf  and  anf  clusters  refer  to  the  V-dependent 

111  and  Fe-only  nitrogenases  respectively. 

112 

113  Figure  2.  Protein  sequence  alignment  of  the  [FeFe]-hydrogenases  of  C.  pasteurianum  and  C. 

114  acetobutylicum.  FeS  cluster  ligating  residues  indicated  with  A,  B,  C,  or  2  to  denote  the  cluster 

115  they  ligate;  H  -  the  H-cluster,  A  -  proximal  cluster  (FS4A),  B  -  FS4B,  C  -  FS4C,  and  2  -  FS2. 

116  Key  residues  within  a  5  angstrom  radius  of  the  H-cluster  are  denoted  with  a  star.  These  residues 


are  conserved  among  representatives  of  phylogentically  clustering  members  of  groups  containing 


CpI/Cal,  CpII  and  CpIII/Calll. 


<Zr 


Figure  2a.  Residues  that  may  play  an  important  role  in  “tuning”  the  H-cluster  of  CpI  and  CpII. 


Figure  3.  Fold  change  of  hydrogenase  transcript  abundance  in  C.  pasteurianum  and  C.  acetobu- 
tylicum  under  N2-fixing  conditions  as  determined  by  qRT-PCR. 
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Figure  4.  Presence  of  nitrogenase  and  hydrogenase  genes  in  sequenced  clostridial  species. 
[FeFe]-hydrogenases  100%  (Red),  [NiFe] -hydrogenase  55%  (Blue),  Mo-nitrogenase  37%  and 
1 1%  (large  and  small  purple  respectively),  and  alternative  nitrogenases  3%  and  3%  (green). 
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129  Figure  5.  Proposed  metabolic  function  of  CpI,  CpII  and  the  [NiFe]-hydrogenase  in  C.  pasteuri- 
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